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HUMAN GPCR PROTEINS 

TECHNICAL FIELD 

This invention relates to nucleic acid and amino acid sequences of human GPCR proteins 
and to the use of these sequences in the diagnosis, treatment, and prevention of cell proliferative, 
5 neurological, and immune disorders. 

BACKGROUND OF THE INVENTION 

The term receptor describes proteins that specifically recognize other molecules. The 
category is broad and includes proteins with a variety of functions. The bulk of the proteins 

10 termed receptors are cell surface proteins which bind extracellular ligands, leading to cellular 
responses including growth, differentiation, endocytosis, and immune response. Other proteins 
termed receptors facilitate the specific transport of proteins across the endoplasmic reticulum 
membrane and localize enzymes to a particular location in the cell. 

G protein coupled receptors (GPCR) are a superfamiiy of integral membrane proteins 

1 5 which transduce extracellular signals. GPCRs include receptors for biogenic amines; for lipid 
mediators of inflammation, peptide hormones, and sensory signal mediators. The GPCR becomes 
activated when the receptor binds its extracellular ligand. Conformational changes in the GPCR, 
which result from the ligand-receptor interaction, affect the binding affinity of a G protein to the 
GPCR intracellular domains. This enables GTP to bind with enhanced affinity to the G protein. 

20 Activation of the G protein by GTP leads to the interaction of the G protein a subunit with 

adenylate cyclase or other second messenger molecule generators. This interaction regulates the 
activity of adenylate cyclase and hence production of a second messenger molecule, cAMP. 
cAMP regulates phosphorylation and activation of other intracellular proteins. Alternatively, 
cellular levels of other second messenger molecules, such as cGMP or eicosinoids, may be 
25 upregulated or downregulated by the activity of GPCRs. The G protein a subunit is deactivated by 
hydrolysis of the GTP by GTPase, and the p, y, and a subunits reassociate. The heterotrimeric G 
protein then dissociates from the adenylate cyclase or other second messenger molecule generator. 
Activity of GPCR may also be regulated by phosphorylation of the intra- and extracellular 
domains or loops. 

30 Visual excitation and the phototransmission of light signals is a signaling cascade in which 

GPCRs play an important role. The process begins in retinal rod cells with the absorption of light 
by the photoreceptor rhodopsin, a GPCR composed of a 40-kDa protein, opsin, and a 
chromophore, 1 1-cis-retinal. The photoisomerization of the retinal chromophore causes a 

1 
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conformational change in the opsin GPCR and activation of the associated G-protein, transducin. 
This activation leads to the hydrolysis of cyclic-GMP and the closure of cyclic-GMP regulated, 
Ca^specific channels in the plasma membrane of the rod cell. The resultant membrane 
hyperpoiarization generates a nerve signal. Recovery of the dark state of the rod cell involves the 
5 activation of guanylate cyclase leading to increased cyclic-GMP levels and the reopening of the 
Ca^-specific channels (L. Stryer (1991) J. Biol. Chem. 266:1071 1-10714). 

Glutamate receptors form a group of GPCRs that are important in neurotransmission. 
Glutamate is the major neurotransmitter in the CNS and is believed to have important roles in 
neuronal plasticity, cognition, memory, learning and some neurological disorders such as epilepsy, 

10 stroke, and neurodegeneration (Watson, S. and S. Arkinstall (1994) The G-Protein Linked 

Rece ptor Facts Book, Academic Press, San Diego CA, pp. 130-132). . These effects of glutamate 
are mediated by two distinct classes of receptors termed ionotropic and metabotropic. lonotropic 
receptors contain an intrinsic cation channel and mediate fast, excitatory actions of glutamate. 
Metabotropic receptors are modulatory, increasing the membrane excitability of neurons by 

1 5 inhibiting calcium dependent potassium conductances and both inhibiting and potentiating 
excitatory transmission of ionotropic receptors. Metabotropic receptors are classified into five 
subtypes based on agonist pharmacology and signal transduction pathways and are widely 
distributed in brain tissues. 

The vasoactive intestinal polypeptide (VIP) family is a group of related polypeptides 

20 whose actions are also mediated by GPCRs. Key members of this family are VIP itself, secretin, 
and growth hormone releasing factor (GRF). VIP has a wide profile of physiological actions 
including relaxation of smooth muscles, stimulation or inhibition of secretion in various tissues, 
modulation of various immune cell activities, and various excitatory and inhibitory activities in the 
CNS. Secretin stimulates secretion of enzymes and ions in the pancreas and intestine and is also 

25 present in small amounts in the brain. GRF is an important neuroendocrine agent regulating 
synthesis and release of growth hormone from the anterior pituitary (Watson, S. and S. Arkinstall 

supra, pp. 278-283). 

The structure of GPCRs is highly-conserved and consists of seven hydrophobic 
transmembrane (serpentine) regions, cysteine disulfide bridges between the second and third 
30 extracellular loops, an extracellular N-terminus, and a cytoplasmic C-terminus. Three 

extracellular loops alternate with three intracellular loops to link the seven transmembrane regions. 
The most conserved parts of these proteins are the transmembrane regions and the first two 
cytoplasmic loops. A conserved, acidic- Arg-aromatic residue triplet present in the second 
cytoplasmic loop may interact with the G-proteins. The consensus pattern of the G-protein 



WO. 00/1 5793 



PCT/US99/209S8 



coupled receptors signature (PS00237; SWISSPROT) is characteristic of most proteins belonging 
to this superfamily (Watson, S. and S. Arkinstall supra, pp. 2-6). 

The discovery of new human GPCR proteins and the polynucleotides encoding them 
satisfies a need in the art by providing new compositions which are useful in the diagnosis, 
5 prevention, and treatment of cell proliferative, neurological, and immune disorders. 

SUMMARY OF THE INVENTION 

The invention features substantially purified polypeptides, human GPCR proteins, referred 
to collectively as "HGPRP". In one aspect, the invention provides a substantially purified 

1 0 polypeptide comprising an amino acid sequence selected from the group consisting of SEQ ID 
NO: 1-6, and fragments thereof. 

The invention further provides a substantially purified variant having at least 90% amino 
acid identity to at least one of the amino acid sequences selected from the group consisting of SEQ 
ID NO: 1-6, and fragments thereof. The invention also provides an isolated and purified 

1 5 polynucleotide encoding the polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-6, and fragments thereof. The invention also includes an 
isolated and purified polynucleotide variant having at least 70% polynucleotide sequence identity 
to the polynucleotide encoding the polypeptide comprising an amino acid sequence selected from 
the group consisting of SEQ ID NO: 1-6, and fragments thereof. 

20 Additionally, the invention provides an isolated and purified polynucleotide which 

hybridizes under stringent conditions to the polynucleotide encoding the polypeptide comprising 
an amino acid sequence selected from the group consisting of SEQ ID NO: 1-6, and fragments 
thereof. The invention also provides an isolated and purified polynucleotide having a sequence 
which is complementary to the polynucleotide encoding the polypeptide comprising the amino 

25 acid sequence selected from the group consisting of SEQ ID NO: 1 -6, and fragments thereof. 
The invention also provides an isolated and purified polynucleotide comprising a 
polynucleotide sequence selected from the group consisting of SEQ ID NO:7-12, and fragments 
thereof. The invention further provides an isolated and purified polynucleotide variant having at 
least 70% polynucleotide sequence identity to the polynucleotide sequence selected from the 

30 group consisting of SEQ ID NO:7-12, and fragments thereof. The invention also provides an 
isolated and purified polynucleotide having a sequence which is complementary to the 
polynucleotide comprising a polynucleotide sequence selected from the group consisting of SEQ 
ID NO:7-12, and fragments thereof. 

The invention also provides a method for detecting a polynucleotide in a sample 

35 containing nucleic acids, the method comprising the steps of: (a) hybridizing the complement of 
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the polynucleotide sequence to at least one of the polynucleotides of the sample, thereby forming a 
hybridization complex; and (b) detecting the hybridization complex, wherein the presence of the 
hybridization complex correlates with the presence of a polynucleotide in the sample. In one 
aspect, the method further comprises amplifying the polynucleotide prior to hybridization. 
5 The invention further provides an expression vector containing at least a fragment of the 

polynucleotide encoding the polypeptide comprising an amino acid sequence selected from the 
group consisting of SEQ ID NO: 1-6, and fragments thereof. In another aspect, the expression 
vector is contained within a host cell. 

The invention also provides a method for producing a polypeptide, the method comprising 
10 the steps of: (a) culturing the host cell containing an expression vector containing at least a 

fragment of a polynucleotide under conditions suitable for the expression of the polypeptide; and 
(b) recovering the polypeptide from the host cell culture. 

The invention also provides a pharmaceutical composition comprising a substantially 
purified polypeptide having the amino acid sequence selected from the group consisting of SEQ 
1 5 ID NO: 1-6, and fragments thereof, in conjunction with a suitable pharmaceutical carrier. 

The invention further includes a purified antibody which binds to a polypeptide selected 
from the group consisting of SEQ ID NO: 1-6, and fragments thereof. The invention also provides 
a purified agonist and a purified antagonist to the polypeptide. 

The invention also provides a method for treating or preventing a disorder associated with 
20 decreased expression or activity of HGPRP, the method comprising administering to a subject in 
need of such treatment an effective amount of a pharmaceutical composition comprising a 
substantially purified polypeptide having the amino acid sequence selected from the group 
consisting of SEQ ID NO: 1-6, and fragments thereof, in conjunction with a suitable 
pharmaceutical carrier. 

25 The invention also provides a method for treating or preventing a disorder associated with 

increased expression or activity of HGPRP, the method comprising administering to a subject in 
need of such treatment an effective amount of an antagonist of a polypeptide having an amino acid 
sequence selected from the group consisting of SEQ ID NO: 1-6, and fragments thereof. 

3 o BRIEF DESCRIPTION OF THE TABLES 

Table 1 shows nucleotide and polypeptide sequence identification numbers (SEQ ID NO), 
clone identification numbers (clone ID), cDNA libraries, and cDNA fragments used to assemble 
full-length sequences encoding HGPRP. 

Table 2 shows features of each polypeptide sequence including potential motifs, 
35 homologous sequences, and methods and algorithms used for identification of HGPRP. 
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Table 3 shows the tissue-specific expression patterns of each nucleic acid sequence as 
determined by northern analysis, conditions, diseases or disorders associated with these tissues, 
and the vector into which each cDNA was cloned. 

Table 4 describes the tissues used to construct the cDNA libraries from which Incyte 
5 clones encoding HGPRP were isolated. 

Table 5 shows the programs, their descriptions, references, and threshold parameters used 

to analyze HGPRP. 

DESCRIPTION OF THE INVENTION 

1 0 Before the present proteins, nucleotide sequences, and methods are described, it is 

understood that this invention is not limited to the particular machines, materials and methods 
described, as these may vary. It is also to be understood that the terminology used herein is for the 
purpose of describing particular embodiments only, and is not intended to limit the scope of the 
present invention which will be limited only by the appended claims. 

1 5 It must be noted that, as used herein and in the appended claims, the singular forms "a," 

"an," and "the" include plural reference unless the context clearly dictates otherwise. Thus, for 
example, a reference to "a host cell" includes a plurality of such host cells, and a reference to "an 
antibody" is a reference to one or more antibodies and equivalents thereof known to those skilled 
in the art, and so forth. 

20 Unless defined otherwise, all technical and scientific terms used herein have the same 

meanings as commonly understood by one of ordinary skill in the art to which this invention 
belongs. Although any machines, materials, and methods similar or equivalent to those described 
herein can be used to practice or test the present invention, the preferred machines, materials and 
methods are now described. All publications mentioned herein are cited for the purpose of 

25 describing and disclosing the cell lines, protocols, reagents and vectors which are reported in the 
publications and which might be used in connection with the invention. Nothing herein is to be 
construed as an admission that the invention is not entitled to antedate such disclosure by virtue of 
prior invention. 
DEFINITIONS 

30 "HGPRP" refers to the amino acid sequences of substantially purified HGPRP obtained 

from any species, particularly a mammalian species, including bovine, ovine, porcine, murine, 
equine, and preferably the human species, from any source, whether natural, synthetic, 
semi-synthetic, or recombinant. 

The term "agonist" refers to a molecule which, when bound to HGPRP, increases or 
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prolongs the duration of the effect of HGPRP. Agonists may include proteins, nucleic acids, 
carbohydrates, or any other molecules which bind to and modulate the effect of HGPRP. 

An "allelic variant" is an alternative form of the gene encoding HGPRP. Allelic variants 
may result from at least one mutation in the nucleic acid sequence and may result in altered 
5 mRNAs or in polypeptides whose structure or function may or may not be altered. Any given 
natural or recombinant gene may have none, one, or many allelic forms. Common mutational 
changes which give rise to allelic variants are generally ascribed to natural deletions, additions, or 
substitutions of nucleotides. Each of these types of changes may occur alone, or in combination 
with the others, one or more times in a given sequence. 

1 0 "Altered" nucleic acid sequences encoding HGPRP include those sequences with 

deletions, insertions, or substitutions of different nucleotides, resulting in a polynucleotide the 
same as HGPRP or a polypeptide with at least one functional characteristic of HGPRP. Included 
within this definition are polymorphisms which may or may not be readily detectable using a 
particular oligonucleotide probe of the polynucleotide encoding HGPRP, and improper or 

1 5 unexpected hybridization to allelic variants, with a locus other than the normal chromosomal locus 
for the polynucleotide sequence encoding HGPRP. The encoded protein may also be "altered," 
and may contain deletions, insertions, or substitutions of amino acid residues which produce a 
silent change and result in a functionally equivalent HGPRP. Deliberate amino acid substitutions 
may be made on the basis of similarity in polarity, charge, solubility, hydrophobicity, 

20 hydrophilicity, and/or the amphipathic nature of the residues, as long as the biological or 

immunological activity of HGPRP is retained. For example, negatively charged amino acids may 
include aspartic acid and glutamic acid, positively charged amino acids may include lysine and 
arginine, and amino acids with uncharged polar head groups having similar hydrophilicity values 
may include leucine, isoleucine, and valine; glycine and alanine; asparagine and glutamine; serine 

25 and threonine; and phenylalanine and tyrosine. 

The terms "amino acid" or "amino acid sequence" refer to an oligopeptide, peptide, 
polypeptide, or protein sequence, or a fragment of any of these, and to naturally occurring or 
synthetic molecules. In this context, "fragments," "immunogenic fragments," or "antigenic 
fragments" refer to fragments of HGPRP which are preferably at least 5 to about 1 5 amino acids in 

30 length, most preferably at least 14 amino acids, and which retain some biological activity or 
immunological activity of HGPRP. Where "amino acid sequence" is recited to refer to an amino 
acid sequence of a naturally occurring protein molecule, "amino acid sequence" and like terms are 
not meant to limit the amino acid sequence to the complete native amino acid sequence associated 
with the recited protein molecule. 
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"Amplification" relates to the production of additional copies of a nucleic acid sequence. 
Amplification is generally carried out using polymerase chain reaction (PCR) technologies well 
known in the art. 

The term ''antagonist' 5 refers to a molecule which, when bound to HGPRP, decreases the 
5 amount or the duration of the effect of the biological or immunological activity of HGPRP. 

Antagonists may include proteins, nucleic acids, carbohydrates, antibodies, or any other molecules 
which decrease the effect of HGPRP. 

The term "antibody" refers to intact molecules as well as to fragments thereof, such as 
Fab, F(ab% and Fv fragments, which are capable of binding the epitopic determinant. Antibodies 
10 that bind HGPRP polypeptides can be prepared using intact polypeptides or using fragments 
containing small peptides of interest as the immunizing antigen. The polypeptide or oligopeptide 
used to immunize an animal (e.g., a mouse, a rat, or a rabbit) can be derived from the translation of 
RNA, or synthesized chemically, and can be conjugated to a carrier protein if desired. Commonly 
used carriers that are chemically coupled to peptides include bovine serum albumin, thyroglobulin, 
1 5 and keyhole limpet hemocyanin (KLH). The coupled peptide is then used to immunize the animal. 
The term "antigenic determinant" refers to that fragment of a molecule (i.e., an epitope) 
that makes contact with a particular antibody. When a protein or a fragment of a protein is used to 
immunize a host animal, numerous regions of the protein may induce the production of antibodies 
which bind specifically to antigenic determinants (given regions or three-dimensional structures on 
20 the protein). An antigenic determinant may compete with the intact antigen (i.e., the immunogen 
used to elicit the immune response) for binding to an antibody. 

The term "antisense" refers to any composition containing a nucleic acid sequence which 
is complementary to the "sense" strand of a specific nucleic acid sequence. Antisense molecules 
may be produced by any method including synthesis or transcription. Once introduced into a cell, 
25 the complementary nucleotides combine with natural sequences produced by the cell to form 
duplexes and to block either transcription or translation. The designation "negative" can refer to 
the antisense strand, and the designation "positive" can refer to the sense strand. 

The term "biologically active" refers to a protein having structural, regulatory, or 
biochemical functions of a naturally occurring molecule. Likewise, "immunologically active ?? 
30 refers to the capability of the natural, recombinant, or synthetic HGPRP, or of any oligopeptide 
thereof, to induce a specific immune response in appropriate animals or cells and to bind with 
specific antibodies. 

The terms "complementary" or "complementarity" refer to the natural binding of 
polynucleotides by base pairing. For example, the sequence "S' A-G-T 3"' bonds to the 
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complementary sequence "3' T-C-A 5'." Complementarity between two single-stranded molecules 
may be "partial," such that only some of the nucleic acids bind, or it may be "complete," such that 
total complementarity exists between the single stranded molecules. The degree of 
complementarity between nucleic acid strands has significant effects on the efficiency and strength 

5 of the hybridization between the nucleic acid strands. This is of particular importance in 
amplification reactions, which depend upon binding between nucleic acids strands, and in the 
design and use of peptide nucleic acid (PNA) molecules. 

A "composition comprising a given polynucleotide sequence" or a "composition 
comprising a given amino acid sequence" refer broadly to any composition containing the given 

10 polynucleotide or amino acid sequence. The composition may comprise a dry formulation or an 
aqueous solution. Compositions comprising polynucleotide sequences encoding HGPRP or 
fragments of HGPRP may be employed as hybridization probes. The probes may be stored in 
freeze-dried form and may be associated with a stabilizing agent such as a carbohydrate. In 
hybridizations, the probe may be deployed in an aqueous solution containing salts (e.g., NaCl), 

1 5 detergents (e.g., sodium dodecyl sulfate; SDS), and other components (e.g., Denhardt's solution, 
dry milk, salmon sperm DNA, etc.). 

"Consensus sequence" refers to a nucleic acid sequence which has been resequenced to 
resolve uncalled bases, extended using the XL-PCR kit (PE Biosystems, Foster City CA) in the 5' 
and/or the 3' direction, and resequenced, or which has been assembled from the overlapping 

20 sequences of more than one Incyte Clone using a computer program for fragment assembly, such 
as the GELVIEW Fragment Assembly system (GCG, Madison WI). Some sequences have been 
both extended and assembled to produce the consensus sequence. 

The term "correlates with expression of a polynucleotide" indicates that the detection of 
the presence of nucleic acids, the same or related to a nucleic acid sequence encoding HGPRP, by 

25 northern analysis is indicative of the presence of nucleic acids encoding HGPRP in a sample, and 
thereby correlates with expression of the transcript from the polynucleotide encoding HGPRP. 

A "deletion" refers to a change in the amino acid or nucleotide sequence that results in the 
absence of one or more amino acid residues or nucleotides. 

The term "derivative" refers to the chemical modification of a polypeptide sequence, or a 

30 polynucleotide sequence. Chemical modifications of a polynucleotide sequence can include, for 
example, replacement of hydrogen by an alkyl, acyl, or amino group. A derivative polynucleotide 
encodes a polypeptide which retains at least one biological or immunological function of the 
natural molecule. A derivative polypeptide is one modified by glycosylation, pegylation, or any 
similar process that retains at least one biological or immunological function of the polypeptide 
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from which it was derived. 

The term "similarity" refers to a degree of complementarity. There may be partial 
similarity or complete similarity. The word "identity" may substitute for the word "similarity". A 
partially complementary sequence that at least partially inhibits an identical sequence from 
5 hybridizing to a target nucleic acid is referred to as "substantially similar." The inhibition of 
hybridization of the completely complementary sequence to the target sequence may be examined 
using a hybridization assay (Southern or northern blot, solution hybridization, and the like) under 
conditions of reduced stringency. A substantially similar sequence or hybridization probe will 
compete for and inhibit the binding of a completely similar (identical) sequence to the target 

1 0 sequence under conditions of reduced stringency. This is not to say that conditions of reduced 
stringency are such that non-specific binding is permitted, as reduced stringency conditions 
require that the binding of two sequences to one another be a specific (i.e., a selective) interaction. 
The absence of non-specific binding may be tested by the use of a second target sequence which 
lacks even a partial degree of complementarity (e.g., less than about 30% similarity or identity). 

1 5 In the absence of non-specific binding, the substantially similar sequence or probe will not 
hybridize to the second non-complementary target sequence. 

The phrases "percent identity" or "% identity" refer to the percentage of sequence 
similarity found in a comparison of two or more amino acid or nucleic acid sequences. Percent 
identity can be determined electronically, e.g., by using the MEGALIGN program (DNASTAR, 

20 Madison WI) which creates alignments between two or more sequences according to methods 
selected by the user, e.g., the clustal method. (See, e.g., Higgins, D.G. and P.M. Sharp (1988) 
Gene 73:237-244.) The clustal algorithm groups sequences into clusters by examining the 
distances between all pairs. The clusters are aligned pairwise and then in groups. The percentage 
similarity between two amino acid sequences, e.g., sequence A and sequence B, is calculated by 

25 dividing the length of sequence A, minus the number of gap residues in sequence A, minus the 
number of gap residues in sequence B, into the sum of the residue matches between sequence A 
and sequence B, times one hundred. Gaps of low or of no similarity between the two amino acid 
sequences are not included in determining percentage similarity. Percent identity between nucleic 
acid sequences can also be counted or calculated by other methods known in the art, e.g., the Jotun 

30 Hein method. (See, e.g., Hein, J. (1990) Methods Enzymol. 183:626-645.) Identity between 
sequences can also be determined by other methods known in the art, e.g., by varying 
hybridization conditions. 

"Human artificial chromosomes" (HAC's) are linear microchromosomes which may 
contain DNA sequences of about 6 kb to 10 Mb in size, and which contain all of the elements 
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required for stable mitotic chromosome segregation and maintenance. 

The term "humanized antibody" refers to antibody molecules in which the amino acid 
sequence in the non-antigen binding regions has been altered so that the antibody more closely 
resembles a human antibody, and still retains its original binding ability. 
5 "Hybridization" refers to any process by which a strand of nucleic acid binds with a 

complementary strand through base pairing. 

The term "hybridization complex" refers to a complex formed between two nucleic acid 
sequences by virtue of the formation of hydrogen bonds between complementary bases. A 
hybridization complex may be formed in solution (e.g., C 0 t or Rot analysis) or formed between one 
10 nucleic acid sequence present in solution and another nucleic acid sequence immobilized on a 
solid support (e.g., paper, membranes, filters, chips, pins or glass slides, or any other appropriate 
substrate to which cells or their nucleic acids have been fixed). 

The words "insertion" or "addition" refer to changes in an amino acid or nucleotide 
sequence resulting in the addition of one or more amino acid residues or nucleotides, respectively, 
1 5 to the sequence found in the naturally occurring molecule. 

"Immune response" can refer to conditions associated with inflammation, trauma, immune 
disorders, or infectious or genetic disease, etc. These conditions can be characterized by 
expression of various factors, e.g., cytokines, chemokines, and other signaling molecules, which 
may affect cellular and systemic defense systems. 
20 The term "microarray" refers to an arrangement of distinct polynucleotides on a substrate. 

The terms "element" or "array element" in a microarray context, refer to hybridizable 
polynucleotides arranged on the surface of a substrate. 

The term "modulate" refers to a change in the activity of HGPRP. For example, 
modulation may cause an increase or a decrease in protein activity, binding characteristics, or any 
25 other biological, functional, or immunological properties of HGPRP. 

The phrases "nucleic acid" or "nucleic acid sequence," as used herein, refer to a 
nucleotide, oligonucleotide, polynucleotide, or any fragment thereof. These phrases also refer to 
DNA or RN A of genomic or synthetic origin which may be single-stranded or double-stranded 
and may represent the sense or the antisense strand, to peptide nucleic acid (PNA), or to any 
30 DNA-like or RNA-like material. In this context, "fragments" refers to those nucleic acid 

sequences which comprise a region of unique polynucleotide sequence that specifically identifies 
SEQ ID NO:7- 1 2, for example, as distinct from any other sequence in the same genome. For 
example, a fragment of SEQ ID NO:7-12 is useful in hybridization and amplification technologies 
and in analogous methods that distinguish SEQ IDNO:7-12 from related polynucleotide 



10 



WO 00/15793 



PCT/US99/20958 



sequences. A fragment of SEQ ID NO:7-12 is at least about 15-20 nucleotides in length. The 
precise length of the fragment of SEQ ID NO:7-12 and the region of SEQ IDNO:7-12 to which 
the fragment corresponds are routinely determinable by one of ordinary skill in the art based on 
the intended purpose for the fragment. In some cases, a fragment, when translated, would produce 
5 polypeptides retaining some functional characteristic, e.g., antigenicity, or structural domain 
characteristic, e.g., ATP-binding site, of the full-length polypeptide. 

The terms "operably associated" or "operably linked" refer to functionally related nucleic 
acid sequences. A promoter is operably associated or operably linked with a coding sequence if 
the promoter controls the translation of the encoded polypeptide. While operably associated or 
10 operably linked nucleic acid sequences can be contiguous and in the same reading frame, certain 
genetic elements, e.g., repressor genes, are not contiguously linked to the sequence encoding the 
polypeptide but still bind to operator sequences that control expression of the polypeptide. 

The term "oligonucleotide" refers to a nucleic acid sequence of at least about 6 
nucleotides to 60 nucleotides, preferably about 15 to 30 nucleotides, and most preferably about 20 
15 to 25 nucleotides, which can be used in PCR amplification or in a hybridization assay or 
microarray. "Oligonucleotide" is substantially equivalent to the terms "amplimer," "primer," 
"oligomer," and "probe," as these terms are commonly defined in the art. 

"Peptide nucleic acid" (PNA) refers to an antisense molecule or anti-gene agent which 
comprises an oligonucleotide of at least about 5 nucleotides in length linked to a peptide backbone 
20 of amino acid residues ending in lysine. The terminal lysine confers solubility to the composition. 
PNAs preferentially bind complementary single stranded DNA or RNA and stop transcript 
elongation, and may be pegylated to extend their lifespan in the cell. 

The term "sample" is used in its broadest sense. A sample suspected of containing nucleic 
acids encoding HGPRP, or fragments thereof, or HGPRP itself, may comprise a bodily fluid; an 
25 extract from a cell, chromosome, organelle, or membrane isolated from a cell; a cell; genomic 
DNA, RNA, or cDNA, in solution or bound to a substrate; a tissue; a tissue print; etc. 

The terms "specific binding" or "specifically binding" refer to that interaction between a 
protein or peptide and an agonist, an antibody, or an antagonist. The interaction is dependent upon 
the presence of a particular structure of the protein, e.g., the antigenic determinant or epitope, 
30 recognized by the binding molecule. For example, if an antibody is specific for epitope "A," the 
presence of a polypeptide containing the epitope A, or the presence of free unlabeled A, in a 
reaction containing free labeled A and the antibody will reduce the amount of labeled A that binds 
to the antibody. 

The term "stringent conditions" refers to conditions which permit hybridization between 
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polynucleotides and the claimed polynucleotides. Stringent conditions can be defined by salt 
concentration, the concentration of organic solvent, e.g., formamide, temperature, and other 
conditions well known in the art. In particular, stringency can be increased by reducing the 
concentration of salt, increasing the concentration of formamide, or raising the hybridization 
5 temperature. 

The term "substantially purified" refers to nucleic acid or amino acid sequences that are 
removed from their natural environment and are isolated or separated, and are at least about 60% 
free, preferably about 75% free, and most preferably about 90% free from other components with 
which they are naturally associated. 

10 A "substitution" refers to the replacement of one or more amino acids or nucleotides by 

different amino acids or nucleotides, respectively. 

"Substrate" refers to any suitable rigid or semi-rigid support including membranes, filters, 
chips, slides, wafers, fibers, magnetic or nonmagnetic beads, gels, tubing, plates, polymers, 
microparticles and capillaries. The substrate can have a variety of surface forms, such as wells, 

1 5 trenches, pins, channels and pores, to which polynucleotides or polypeptides are bound. 

"Transformation" describes a process by which exogenous DNA enters and changes a 
recipient cell. Transformation may occur under natural or artificial conditions according to 
various methods well known in the art, and may rely on any known method for the insertion of 
foreign nucleic acid sequences into a prokaryotic or eukaryotic host cell. The method for 

20 transformation is selected based on the type of host cell being transformed and may include, but is 
not limited to, viral infection, electroporation, heat shock, lipofection, and particle bombardment. 
The term "transformed" cells includes stably transformed cells in which the inserted DNA is 
capable of replication either as an autonomously replicating plasmid or as part of the host 
chromosome, as well as transiently transformed cells which express the inserted DNA or RNA for 

25 limited periods of time. 

A "variant" of HGPRP polypeptides refers to an amino acid sequence that is altered by 
one or more amino acid residues. The variant may have "conservative" changes, wherein a 
substituted amino acid has similar structural or chemical properties (e.g., replacement of leucine 
with isoleucine). More rarely, a variant may have "nonconservative" changes (e.g., replacement 

30 of glycine with tryptophan). Analogous minor variations may also include amino acid deletions or 
insertions, or both. Guidance in determining which amino acid residues may be substituted, 
inserted, or deleted without abolishing biological or immunological activity may be found using 
computer programs well known in the art, for example, LASERGENE software (DNASTAR). 
The term "variant", when used in the context of a polynucleotide sequence, may 
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encompass a polynucleotide sequence related to HGPRP. This definition may also include, for 
example, "allelic" (as defined above), "splice," "species," or "polymorphic" variants. A splice 
variant may have significant identity to a reference molecule, but will generally have a greater or 
lesser number of polynucleotides due to alternate splicing of exons during mRNA processing. The 
5 corresponding polypeptide may possess additional functional domains or an absence of domains. 
Species variants are polynucleotide sequences that vary from one species to another. The resulting 
polypeptides generally will have significant amino acid identity relative to each other. A 
polymorphic variant is a variation in the polynucleotide sequence of a particular gene between 
individuals of a given species. Polymorphic variants also may encompass "single nucleotide 
10 polymorphisms" (SNPs) in which the polynucleotide sequence varies by one base. The presence 
of SNPs may be indicative of, for example, a certain population, a disease state, or a propensity for 
a disease state. 
THE INVENTION 

The invention is based on the discovery of new human GPCR proteins (HGPRP), the 

1 5 polynucleotides encoding HGPRP, and the use of these compositions for the diagnosis, treatment, 
or prevention of cell proliferative, neurological, and immune disorders. 

Table 1 lists the Incyte Clones used to derive full length nucleotide sequences encoding 
HGPRP. Columns 1 and 2 show the sequence identification numbers (SEQ ID NO) of the amino 
acid and nucleic acid sequences, respectively. Column 3 shows the Clone ID of the Incyte Clone 

20 in which nucleic acids encoding each HGPRP were identified, and column 4, the cDNA libraries 
from which these clones were isolated. Column 5 shows Incyte clones, their corresponding cDNA 
libraries, and shotgun sequences. The clones and shotgun sequences are part of the consensus 
nucleotide sequence of each HGPRP and are useful as fragments in hybridization technologies. 
The columns of Table 2 show various properties of the polypeptides of the invention: 

25 column 1 references the SEQ ID NO; column 2 shows the number of amino acid residues in each 
polypeptide; column 3, potential phosphorylation sites; column 4, potential glycosylation sites; 
column 5, the amino acid residues comprising signature sequences and motifs; column 6, the 
identity of each protein; and column 7, analytical methods used to identify each protein through 
sequence homology and protein motifs. 

30 The columns of Table 3 show the tissue-specificity and diseases, disorders, or conditions 

associated with nucleotide sequences encoding HGPRP. The first column of Table 3 lists the 
polypeptide sequence identifiers. The second column lists tissue categories which express HGPRP 
as a fraction of total tissue categories expressing HGPRP. The third column lists the diseases, 
disorders, or conditions associated with those tissues expressing HGPRP. The fourth column lists 
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the vectors used to subclone the cDNA library. 

The following fragments of the nucleotide sequences encoding HGPRP are useful in 
hybridization or amplification technologies to identify SEQ ID NO:7-12 and to distinguish 
between SEQ ID NO:7-12 and related polynucleotide sequences. The useful fragments are the 
5 fragment of SEQ ID NO:7 from about nucleotide 235 to about nucleotide 270; the fragment of 
SEQ ID NO: 8 from about nucleotide 2 1 8 to about nucleotide 247; the fragment of SEQ ID NO:9 
from about nucleotide 27 1 to about nucleotide 300; the fragment of SEQ ID NO: 1 0 from about 
nucleotide 273 to about nucleotide 303; the fragment of SEQ ID NO: 1 1 from about nucleotide 542 
to about nucleotide 571; and the fragment of SEQ ID NO: 12 from about nucleotide 703 to about 
10 nucleotide 735. 

The invention also encompasses HGPRP variants. A preferred HGPRP variant is one 
which has at least about 80%, more preferably at least about 90%, and most preferably at least 
about 95% amino acid sequence identity to the HGPRP amino acid sequence, and which contains 
at least one functional or structural characteristic of HGPRP. 

1 5 The invention also encompasses polynucleotides which encode HGPRP. In a particular 

embodiment, the invention encompasses a polynucleotide sequence comprising a sequence 
selected from the group consisting of SEQ ID NO:7-12, which encodes HGPRP. 

The invention also encompasses a variant of a polynucleotide sequence encoding HGPRP. 
In particular, such a variant polynucleotide sequence will have at least about 70%, more preferably 

20 at least about 85%, and most preferably at least about 95% polynucleotide sequence identity to the 
polynucleotide sequence encoding HGPRP. A particular aspect of the invention encompasses a 
variant of a polynucleotide sequence comprising a sequence selected from the group consisting of 
SEQ ID NO:7-12 which has at least about 70%. more preferably at least about 85%, and most 
preferably at least about 95% polynucleotide sequence identity to a nucleic acid sequence selected 

25 from the group consisting of SEQ ID NO:7- 12. Any one of the polynucleotide variants described 
above can encode an amino acid sequence which contains at least one functional or structural 
characteristic of HGPRP. 

It will be appreciated by those skilled in the art that, as a result of the degeneracy of the 
genetic code, a multitude of polynucleotide sequences encoding HGPRP, some bearing minimal 

30 similarity to the polynucleotide sequences of any known and naturally occurring gene, may be 
produced. Thus, the invention contemplates each and every possible variation of polynucleotide 
sequence that could be made by selecting combinations based on possible codon choices. These 
combinations are made in accordance with the standard triplet genetic code as applied to the 
polynucleotide sequence of naturally occurring HGPRP, and all such variations are to be 
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considered as being specifically disclosed. 

Although nucleotide sequences which encode HGPRP and its variants are preferably 
capable of hybridizing to the nucleotide sequence of the naturally occurring HGPRP under 
appropriately selected conditions of stringency, it may be advantageous to produce nucleotide 
5 sequences encoding HGPRP or its derivatives possessing a substantially different codon usage, 
e.g., inclusion of non-natural ly occurring codons. Codons may be selected to increase the rate at 
which expression of the peptide occurs in a particular prokaryotic or eukaryotic host in accordance 
with the frequency with which particular codons are utilized by the host. Other reasons for 
substantially altering the nucleotide sequence encoding HGPRP and its derivatives without 
1 0 altering the encoded amino acid sequences include the production of RNA transcripts having more 
desirable properties, such as a greater half-life, than transcripts produced from the naturally 
occurring sequence. 

The invention also encompasses production of DNA sequences which encode HGPRP and 
HGPRP derivatives, or fragments thereof, entirely by synthetic chemistry. After production, the 

1 5 synthetic sequence may be inserted into any of the many available expression vectors and cell 
systems using reagents well known in the art. Moreover, synthetic chemistry may be used to 
introduce mutations into a sequence encoding HGPRP or any fragment thereof. 

Also encompassed by the invention are polynucleotide sequences that are capable of 
hybridizing to the claimed polynucleotide sequences, and, in particular, to those shown in SEQ ID 

20 NO:7-12 and fragments thereof under various conditions of stringency. (See, e.g., Wahl, G.M. 
and S.L. Berger (1987) Methods Enzymol. 152:399-407; Kimmel, A.R. (1987) Methods Enzymol. 
1 52:507-5 1 1 .) For example, stringent salt concentration will ordinarily be less than about 750 mM 
NaCl and 75 mM trisodium citrate, preferably less than about 500 mM NaCI and 50 mM trisodium 
citrate, and most preferably less than about 250 mM NaCl and 25 mM trisodium citrate. Low 

25 stringency hybridization can be obtained in the absence of organic solvent, e.g., formamide, while 
high stringency hybridization can be obtained in the presence of at least about 35% formamide, 
and most preferably at least about 50% formamide. Stringent temperature conditions will 
ordinarily include temperatures of at least about 30°C, more preferably of at least about 37°C and 
most preferably of at least about 42°C. Varying additional parameters, such as hybridization time, 

30 the concentration of detergent, e.g., sodium dodecyl sulfate (SDS), and the inclusion or exclusion 
of carrier DNA, are well known to those skilled in the art. Various levels of stringency are 
accomplished by combining these various conditions as needed. In a preferred embodiment, 
hybridization will occur at 30°C in 750 mM NaCl, 75 mM trisodium citrate, and 1% SDS. In a 
more preferred embodiment, hybridization will occur at 37°C in 500 mM NaCl, 50 mM trisodium 
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citrate, 1% SDS, 35% formamide, and 100 ^ts/ml denatured salmon sperm DNA (ssDNA). In a 
most preferred embodiment hybridization will occur at 42°C in 250 mM NaCl, 25 mM trisodium 
citrate, 1% SDS, 50 % formamide, and 200jug/ml ssDNA. Useful variations on these conditions 
will be readily apparent to those skilled in the art. 
5 The washing steps which follow hybridization can also vary in stringency. Wash 

stringency conditions can be defined by salt concentration and by temperature. As above, wash 
stringency can be increased by decreasing salt concentration or by increasing temperature. For 
example, stringent salt concentration for the wash steps will preferably be less than about 30 mM 
NaCl and 3 mM trisodium citrate, and most preferably less than about 15 mM NaCl and 1.5 mM 

1 0 trisodium citrate. Stringent temperature conditions for the wash steps will ordinarily include 
temperature of at least about 25°C, more preferably of at least about 42°C, and most preferably of 
at least about 68°C In a preferred embodiment, wash steps will occur at 25°C in 30 mM NaCl, 3 
mM trisodium citrate, and 0.1% SDS. In a more preferred embodiment, wash steps will occur at 
42°C in 15 mM NaCL 1.5 mM trisodium citrate, and 0.1% SDS. In a most preferred embodiment, 

1 5 wash steps will occur at 68°C in 15 mM NaCl, 1.5 mM trisodium citrate, and 0.1% SDS. 
Additional variations on these conditions will be readily apparent to those skilled in the art. 

Methods for DNA sequencing are well known in the art and may be used to practice any 
of the embodiments of the invention. The methods may employ such enzymes as the Klenow 
fragment of DNA polymerase I, SEQUENASE (US Biochemical, Cleveland OH), Taq DNA 

20 polymerase (PE Biosystems), thermostable T7 polymerase (Amersham Pharmacia Biotech, 
Piscataway NJ) 5 or combinations of polymerases and proofreading exonucleases such as those 
found in the ELONGASE amplification system (Life Technologies, Gaithersburg MD). 
Preferably, sequence preparation is automated with machines such as the MICROLAB 2200 
system (Hamilton, Reno NV), DNA ENGINE thermal cycler (PTC200; MJ Research, Watertown 

25 MA) and the ABI CATALYST 800 (PE Biosystems). Sequencing is then carried out using either 
ABI PRISM 373 or 377 DNA sequencing systems (PE Biosystems) or the MEGABACE 1000 
DNA sequencing system (Amersham Pharmacia Biotech). The resulting sequences are analyzed 
using a variety of algorithms which are well known in the art. (See, e.g., Ausubel, F.M. (1997) 
Short Protocols in Molecular Biology , John Wiley & Sons, New York NY, unit 7.7; Meyers, R.A. 

30 (1995) Molecular Biology and Biotechnology , Wiley VCH, New York NY, pp. 856-853.) 

The nucleic acid sequences encoding HGPRP may be extended utilizing a partial 
nucleotide sequence and employing various PCR-based methods known in the art to detect 
upstream sequences, such as promoters and regulatory elements. For example, one method which 
may be employed, restriction-site PCR, uses universal and nested primers to amplify unknown 
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sequence from genomic DNA within a cloning vector. (See, e.g., Sarkar, G. (1993) PCR Methods 
Applic. 2:3 18-322.) Another method, inverse PCR, uses primers that extend in divergent 
directions to amplify unknown sequence from a circularized template. The template is derived 
from restriction fragments comprising a known genomic locus and surrounding sequences. (See, 
5 e.g., Triglia, T. et al. (1988) Nucleic Acids Res. 16:8186.) A third method, capture PCR, involves 
PCR amplification of DNA fragments adjacent to known sequences in human and yeast artificial 
chromosome DNA. (See, e.g., Lagerstrom, M. et al. (1991) PCR Methods Applic. 1:111-119.) In 
this method, multiple restriction enzyme digestions and ligations may be used to insert an 
engineered double-stranded sequence into a region of unknown sequence before performing PCR. 
1 0 Other methods which may be used to retrieve unknown sequences are known in the art. (See, e.g., 
Parker, J.D. et al. (1991) Nucleic Acids Res. 19:3055-3060). Additionally, one may use PCR, 
nested primers, and PRO MOTERFIN D ER libraries (Clontech, Palo Alto CA) to walk genomic 
DNA. This procedure avoids the need to screen libraries and is useful in finding intron/exon 
junctions. For all PCR-based methods, primers may be designed using commercially available 
1 5 software, such as OLIGO 4.06 primer analysis software (National Biosciences, Plymouth MN) or 
another appropriate program, to be about 22 to 30 nucleotides in length, to have a GC content of 
about 50% or more, and to anneal to the template at temperatures of about 68°C to 72°C. 

When screening for full-length cDNAs, it is preferable to use libraries that have been 
size-selected to include larger cDNAs. In addition, random-primed libraries, which often include 
20 sequences containing the 5 f regions of genes, are preferable for situations in which an ol igo d(T) 
library does not yield a full-length cDNA. Genomic libraries may be useful for extension of 
sequence into 5' non-transcribed regulatory regions. 

Capillary electrophoresis systems which are commercially available may be used to 
analyze the size or confirm the nucleotide sequence of sequencing or PCR products. In particular, 
25 capillary sequencing may employ flowable polymers for electrophoretic separation, four different 
nucleotide-specific, laser-stimulated fluorescent dyes, and a charge coupled device camera for 
detection of the emitted wavelengths. Output/light intensity may be converted to electrical signal 
using appropriate software (e.g., GENOTYPER and SEQUENCE NAVIGATOR software, PE 
Biosystems), and the entire process from loading of samples to computer analysis and electronic 
30 data display may be computer controlled. Capillary electrophoresis is especially preferable for 
sequencing small DNA fragments which may be present in limited amounts in a particular sample. 

In another embodiment of the invention, polynucleotide sequences or fragments thereof 
which encode HGPRP may be cloned in recombinant DNA molecules that direct expression of 
HGPRP, or fragments or functional equivalents thereof, in appropriate host cells. Due to the 
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inherent degeneracy of the genetic code, other DNA sequences which encode substantially the 
same or a functionally equivalent amino acid sequence may be produced and used to express 
HGPRP. 

The nucleotide sequences of the present invention can be engineered using methods 
5 generally known in the art in order to alter HGPRP-encoding sequences for a variety of purposes 
including, but not limited to, modification of the cloning, processing, and/or expression of the 
gene product. DNA shuffling by random fragmentation and PCR reassembly of gene fragments 
and synthetic oligonucleotides may be used to engineer the nucleotide sequences. For example, 
oligonucleotide-mediated site-directed mutagenesis may be used to introduce mutations that create 
1 0 new restriction sites, alter glycosylation patterns, change codon preference, produce splice 
variants, and so forth. 

In another embodiment, sequences encoding HGPRP may be synthesized, in whole or in 
part, using chemical methods well known in the art. (See, e.g., Caruthers, M.H. et al. (1980) 
Nucleic Acids Symp. Ser. 7:215-223; Horn, T. et al. (1980) Nucleic Acids Symp. Ser. 7:225-232.) 

1 5 Alternatively, HGPRP itself or a fragment thereof may be synthesized using chemical methods. 
For example, peptide synthesis can be performed using various solid-phase techniques. (See, e.g., 
Roberge, J.Y. et al. (1995) Science 269:202-204.) Automated synthesis may be achieved using 
the ABI 43 1 A peptide synthesizer (PE Biosystems). Additionally, the amino acid sequence of 
HGPRP, or any part thereof, may be altered during direct synthesis and/or combined with 

20 sequences from other proteins, or any part thereof, to produce a variant polypeptide. 

The peptide may be substantially purified by preparative high performance liquid 
chromatography. (See, e.g, Chiez, R.M. and F.Z. Regnier ( 1 990) Methods Enzymol. 182:392- 
421 .) The composition of the synthetic peptides may be confirmed by amino acid analysis or by 
sequencing. (See, e.g., Creighton, T. fiQSM) P^ins Structures and Molecular Properties, WH 

25 Freeman, New York NY.) 

In order to express a biologically active HGPRP, the nucleotide sequences encoding 
HGPRP or derivatives thereof may be inserted into an appropriate expression vector, i.e., a vector 
which contains the necessary elements for transcriptional and translation^ control of the inserted 
coding sequence in a suitable host. These elements include regulatory sequences, such as 

30 enhancers, constitutive and inducible promoters, and 5' and 3' untranslated regions in the vector 
and in polynucleotide sequences encoding HGPRP. Such elements may vary in their strength and 
specificity. Specific initiation signals may also be used to achieve more efficient translation of 
sequences encoding HGPRP. Such signals include the ATG initiation codon and adjacent 
sequences, e.g. the Kozak sequence. In cases where sequences encoding HGPRP and its initiation 
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codon and upstream regulatory sequences are inserted into the appropriate expression vector, no 
additional transcriptional or translational control signals may be needed. However, in cases where 
only coding sequence, or a fragment thereof, is inserted, exogenous translational control signals 
including an in-frame ATG initiation codon should be provided by the vector. Exogenous 
5 translational elements and initiation codons may be of various origins, both natural and synthetic. 
The efficiency of expression may be enhanced by the inclusion of enhancers appropriate for the 
particular host cell system used. (See, e.g., Scharf, D. et al. (1994) Results Probl. Cell Differ. 
20:125-162.) 

Methods which are well known to those skilled in the art may be used to construct 
10 expression vectors containing sequences encoding HGPRP and appropriate transcriptional and 
translational control elements. These methods include in vitro recombinant DNA techniques, 
synthetic techniques, and in vivo genetic recombination. (See, e.g., Sambrook, J. et al. (1989) 
Molecular Cloning. A Laboratory Manual . Cold Spring Harbor Press, Plainview NY, ch. 4, 8, and 
16-17: Ausubel, F.M. et al. H995^ Current Protocols in Molecula r Biology. John Wiley & Sons, 
15 New York NY, ch. 9, 13, and 16.) 

A variety of expression vector/host systems may be utilized to contain and express 
sequences encoding HGPRP. These include, but are not limited to, microorganisms such as 
bacteria transformed with recombinant bacteriophage, plasmid, or cosmid DNA expression 
vectors; yeast transformed with yeast expression vectors; insect cell systems infected with viral 
20 expression vectors (e.g., baculovirus); plant cell systems transformed with viral expression vectors 
(e.g., cauliflower mosaic virus, CaMV, or tobacco mosaic virus,TMV) or with bacterial expression 
vectors (e.g., Ti or pBR322 plasmids); or animal cell systems. The invention is not limited by the 
host cell employed. 

In bacterial systems, a number of cloning and expression vectors may be selected 
25 depending upon the use intended for polynucleotide sequences encoding HGPRP. For example, 
routine cloning, subcloning, and propagation of polynucleotide sequences encoding HGPRP can 
be achieved using a multifunctional R^cdi vector such as PBLUESCRIPT (Stratagene, La Jolla 
CA) or PSPORT1 plasmid (Life Technologies). Ligation of sequences encoding HGPRP into the 
vector's multiple cloning site disrupts the lacZ gene, allowing a colorimetric screening procedure 
30 for identification of transformed bacteria containing recombinant molecules. In addition, these 
vectors may be useful for in vitro transcription, dideoxy sequencing, single strand rescue with 
helper phage, and creation of nested deletions in the cloned sequence. (See, e.g., Van Heeke, G. 
and S.M. Schuster (1989) J. Biol. Chem. 264:5503-5509.) When large quantities of HGPRP are 
needed, e.g. for the production of antibodies, vectors which direct high level expression of HGPRP 
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may be used. For example, vectors containing the strong, inducible T5 or T7 bacteriophage 
promoter may be used. 

Yeast expression systems may be used for production of HGPRP. A number of vectors 
containing constitutive or inducible promoters, such as alpha factor, alcohol oxidase, and PGH, 
5 may be used in the yeast Saccharomvces cerevisiae or Pichia pastoris . In addition, such vectors 
direct either the secretion or intracellular retention of expressed proteins and enable integration of 
foreign sequences into the host genome for stable propagation. (See, e.g., Ausubel, 1995. supra ; 
Bitter, G.A. et al. (1987) Methods Enzymol. 153:516-544; and Scorer, C.A. et al. (1994) 
Bio/Technology 12:181-184.) 

1 0 Plant systems may also be used for expression of HGPRP. Transcription of sequences 

encoding HGPRP may be driven viral promoters, e.g., the 35S and 19S promoters of CaMV used 
alone or in combination with the omega leader sequence from TMV (Takamatsu, N. (1987) 
EMBO J. 6:307-3 11). Alternatively, plant promoters such as the small subunit of RUBISCO or 
heat shock promoters may be used. (See, e.g., Coruzzi, G. et al. (1984) EMBO J. 3:1671-1680; 

15 Broglie, R. et al. (1984) Science 224:838-843; and Winter, J. et al. (1991) Results Probl. Cell 
Differ. 17:85-105.) These constructs can be introduced into plant cells by direct DNA 
transformation or pathogen-mediated transfection. (See, e.g.. The McGraw Hill Yearbook of 
Science and Technology (1992) McGraw Hill, New York NY, pp. 191-196.) 

In mammalian cells, a number of viral-based expression systems may be utilized. In cases 

20 where an adenovirus is used as an expression vector, sequences encoding HGPRP may be ligated 
into an adenovirus transcription/translation complex consisting of the late promoter and tripartite 
leader sequence. Insertion in a non-essential El or E3 region of the viral genome may be used to 
obtain infective virus which expresses HGPRP in host cells. (See, e.g., Logan, J. and T. Shenk 
(1984) Proc. Natl. Acad. Sci. USA 81:3655-3659.) In addition, transcription enhancers, such as 

25 the Rous sarcoma virus (RSV) enhancer, may be used to increase expression in mammalian host 
cells. SV40 or EBV-based vectors may also be used for high-level protein expression. 

Human artificial chromosomes (HACs) may also be employed to deliver larger fragments 
of DNA than can be contained in and expressed .from a plasmid. HACs of about 6 kb to 10 Mb 
are constructed and delivered via conventional delivery methods (liposomes, polycationic amino 

30 polymers, or vesicles) for therapeutic purposes. (See, e.g., Harrington, J.J. et al. (1997) Nat. 
Genet. 15:345-355.) 

For long term production of recombinant proteins in mammalian systems, stable 
expression of HGPRP in cell lines is preferred. For example, sequences encoding HGPRP can be 
transformed into cell lines using expression vectors which may contain viral origins of replication 
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and/or endogenous expression elements and a selectable marker gene on the same or on a separate 
vector. Following the introduction of the vector, cells may be allowed to grow for about 1 to 2 
days in enriched media before being switched to selective media. The purpose of the selectable 
marker is to confer resistance to a selective agent, and its presence allows growth and recovery of 
5 cells which successfully express the introduced sequences. Resistant clones of stably transformed 
cells may be propagated using tissue culture techniques appropriate to the cell type. 

Any number of selection systems may be used to recover transformed cell lines. These 
include, but are not limited to, the herpes simplex virus thymidine kinase and adenine 
phosphoribosyltransferase genes, for use in tk oxapr cells, respectively. (See, e.g., Wigler, M. et 

10 al. (1977) Cell 1 1:223-232; Lowy, I. et al. (1980) Cell 22:817-823.) Also, antimetabolite, 

antibiotic, or herbicide resistance can be used as the basis for selection. For example, dhfr confers 
resistance to methotrexate; neo confers resistance to the aminoglycosides, neomycin and G-418; 
and als and pat confer resistance to chlorsulfuron and phosphinotricin acetyltransferase, 
respectively. (See, e.g., Wigler, M. et al. (1980) Proc. Natl. Acad. Sci. USA 77:3567-3570; 

15 Colbere-Garapin,F.etal.(1981)J.Mol.Biol. 150:1-14.) Additional selectable genes have been 
described, e.g., trpB and hisD, which alter cellular requirements for metabolites. (See, e.g., 
Hartman, S.C. and R.C. Mulligan (1988) Proc. Natl. Acad. Sci. USA 85:8047-8051.) Visible 
markers, e.g., anthocyanins, green fluorescent proteins (GFP; Clontech), B glucuronidase and its 
substrate B-glucuronide, or luciferase and its substrate luciferin may be used. These markers can 

20 be used not only to identify transformants, but also to quantify the amount of transient or stable 
protein expression attributable to a specific vector system. (See, e.g., Rhodes, C.A. (1995) 
Methods Mol. Biol. 55:121-131.) 

Although the presence/absence of marker gene expression suggests that the gene of 
interest is also present, the presence and expression of the gene may need to be confirmed. For 

25 example, if the sequence encoding HGPRP is inserted within a marker gene sequence, transformed 
cells containing sequences encoding HGPRP can be identified by the absence of marker gene 
function. Alternatively, a marker gene can be placed in tandem with a sequence encoding HGPRP 
under the control of a single promoter. Expression of the marker gene in response to induction or 
selection usually indicates expression of the tandem gene as well. 

30 In general, host cells that contain the nucleic acid sequence encoding HGPRP and that 

express HGPRP may be identified by a variety of procedures known to those of skill in the art. 
These procedures include, but are not limited to, DNA-DNA or DNA-RNA hybridizations, PCR 
amplification, and protein bioassay or immunoassay techniques which include membrane, 
solution, or chip based technologies for the detection and/or quantification of nucleic acid or 
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protein sequences. 

Immunological methods for detecting and measuring the expression of HGPRP using 
either specific polyclonal or monoclonal antibodies are known in the art. Examples of such 
techniques include enzyme-linked immunosorbent assays (ELISAs), radioimmunoassays (RIAs), 
5 and fluorescence activated cell sorting (FACS). A two-site, monoclonal-based immunoassay 
utilizing monoclonal antibodies reactive to two non- interfering epitopes on HGPRP is preferred, 
but a competitive binding assay may be employed. These and other assays are well known in the 
art. (See, e.g., Hampton, R. et al. (1990) Serological M ethods, a Laboratory Manual. APS Press, 
St. Paul MN, Sect. IV; Coligan, J.E. et al. "0071 Current Protocols in Immunology, Greene Pub. 

1 0 Associates and Wiley-Interscience, New York NY; and Pound, J.D. ( 1 998) Immunochemical 
Protocols , Humana Press, Totowa NJ.) 

A wide variety of labels and conjugation techniques are known by those skilled in the art 
and may be used in various nucleic acid and amino acid assays. Means for producing labeled 
hybridization or PCR probes for detecting sequences related to polynucleotides encoding HGPRP 

1 5 include oligolabeling, nick translation, end-labeling, or PCR amplification using a labeled 
nucleotide. Alternatively, the sequences encoding HGPRP, or any fragments thereof, may be 
cloned into a vector for the production of an mRNA probe. Such vectors are known in the art, are 
commercially available, and may be used to synthesize RNA probes in vitro by addition of an 
appropriate RNA polymerase such as T7, T3, or.SP6 and labeled nucleotides. These procedures 

20 may be conducted using a variety of commercially available kits, such as those provided by 
Amersham Pharmacia Biotech, Promega (Madison WI), and US Biochemical. Suitable reporter 
molecules or labels which may be used for ease of detection include radionuclides, enzymes, 
fluorescent, chemiluminescent, or chromogenic agents, as well as substrates, cofactors, inhibitors, 
magnetic particles, and the like. 

25 Host cells transformed with nucleotide sequences encoding HGPRP may be cultured 

under conditions suitable for the expression and recovery of the protein from cell culture. The 
protein produced by a transformed cell may be secreted or retained intracellularly depending on 
the sequence and/or the vector used. As will be understood by those of skill in the art, expression 
vectors containing polynucleotides which encode HGPRP may be designed to contain signal 

30 sequences which direct secretion of HGPRP through a prokaryotic or eukaryotic cell membrane. 

In addition, a host cell strain may be chosen for its ability to modulate expression of the 
inserted sequences or to process the expressed protein in the desired fashion. Such modifications 
of the polypeptide include, but are not limited to, acetylation, carboxylation, glycosylation, 
phosphorylation, lipidation, and acylation. Post-translational processing which cleaves a :: prepro !! 
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form of the protein may also be used to specify protein targeting, folding, and/or activity. 
Different host cells which have specific cellular machinery and characteristic mechanisms for 
post-translational activities (e.g., CHCX HeLa. MDCK, HEK293, and W138), are available from 
the American Type Culture Collection (ATCC Manassas VA) and may be chosen to ensure the 
5 correct modification and processing of the foreign protein. 

In another embodiment of the invention, natural, modified, or recombinant nucleic acid 
sequences encoding HGPRP may be ligated to a heterologous sequence resulting in translation of 
a fusion protein in any of the aforementioned host systems. For example, a chimeric HGPRP 
protein containing a heterologous moiety that can be recognized by a commercially available 

1 0 antibody may facilitate the screening of peptide libraries for inhibitors of HGPRP activity. 

Heterologous protein and peptide moieties may also facilitate purification of fusion proteins using 
commercially available affinity matrices. Such moieties include, but are not limited to, glutathione 
S-transferase (GST), maltose binding protein (MBP), thioredoxin (Trx), calmodulin binding 
peptide (CBP), 6-His, FLAG, c-myc, and hemagglutinin (HA). GST, MBP, Trx, CBP, and 6-His 

1 5 enable purification of their cognate fusion proteins on immobilized glutathione, maltose, 
phenylarsine oxide, calmodulin, and metal-chelate resins, respectively. FLAG, c-myc, and 
hemagglutinin (HA) enable immunoaffinity purification of fusion proteins using commercially 
available monoclonal and polyclonal antibodies that specifically recognize these epitope tags. A 
fusion protein may also be engineered to contain a proteolytic cleavage site located between the 

20 HGPRP encoding sequence and the heterologous protein sequence, so that HGPRP may be 
cleaved away from the heterologous moiety following purification. Methods for fusion protein 
expression and purification are discussed in Ausubel (1995. supra , ch 10). A variety of 
commercially available kits may also be used to facilitate expression and purification of fusion 
proteins. 

25 In a further embodiment of the invention, synthesis of radiolabeled HGPRP may be 

achieved in vitro using the TNT rabbit reticulocyte lysate or wheat germ extract systems 
(Promega). These systems couple transcription and translation of protein-coding sequences 
operably associated with the T7, T3, or SP6 promoters. Translation takes place in the presence of 
a radiolabeled amino acid precursor, preferably 35 S-methionine. 

30 Fragments of HGPRP may be produced not only by recombinant production, but also by 

direct peptide synthesis using solid-phase techniques. (See, e.g., Creighton.iugra, pp. 55-60.) 
Protein synthesis may be performed by manual techniques or by automation. Automated synthesis 
may be achieved, for example, using the ABI 43 1 A peptide synthesizer (PE Biosystems). Various 
fragments of HGPRP may be synthesized separately and then combined to produce the full length 
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molecule. 

THERAPEUTICS 

Chemical and structural similarity, e.g., in the context of sequences and motifs, exists 
between regions of HGPRP and GPCR proteins. In addition, the expression of HGPRP is closely 
5 associated with cell proliferative and immune disorders, and with neurological tissues. Therefore, 
HGPRP appears to play a role in cell proliferative, neurological, and immune disorders. In the 
treatment of disorders associated with increased HGPRP expression or activity, it is desirable to 
decrease the expression or activity of HGPRP. In the treatment of disorders associated with 
decreased HGPRP expression or activity, it is desirable to increase the expression or activity of 
10 HGPRP. 

Therefore, in one embodiment, HGPRP or a fragment or derivative thereof may be 
administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of HGPRP. Examples of such disorders include, but are not limited to, a cell proliferative 
disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, cirrhosis, hepatitis, 

1 5 mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal hemoglobinuria, 
polycythemia vera, psoriasis, primary thrombocythemia; cancers including adenocarcinoma, 
leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in particular, cancers 
of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall bladder, ganglia, 
gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, parathyroid, penis, 

20 prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; an immune disorder such 
as acquired immunodeficiency syndrome (AIDS), Addison's disease, adult respiratory distress 
syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, atherosclerosis, 
autoimmune hemolytic anemia, autoimmune thyroiditis, bronchitis, cholecystitis, contact 
dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes mellitus, emphysema, 

25 episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema nodosum, 
atrophic gastritis, glomerulonephritis, Goodpasture's syndrome, gout, Graves' disease, 
Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, 
myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, 
pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, 

30 Sjogren's syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, 
thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, 
hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and 
helminthic infections, and trauma; and a neurological disorder such as epilepsy, ischemic 
cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease. Pick's disease, 
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Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, 
amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular 
atrophy, retinitis pigmentosa, hereditary ataxias; multiple sclerosis and other demyelinating 
diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, 
5 suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system 
disease; prion diseases including kuru, Creutzfeldt-Jakob disease, and Gerstmann- 
Straussler-Scheinker syndrome; fatal familial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the 
1 0 central nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system 
disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other 
neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and polymyositis; 
inherited, metabolic, endocrine, and toxic myopathies; myasthenia gravis, periodic paralysis; 
mental disorders including mood, anxiety, and schizophrenic disorders; akathesia, amnesia, 
1 5 catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 
neuralgia, and Tourette's disorder. 

In another embodiment, a vector capable of expressing HGPRP or a fragment or 
derivative thereof may be administered to a subject to treat or prevent a disorder associated with 
decreased expression or activity of HGPRP including, but not limited to, those described above. 
20 In a further embodiment, a pharmaceutical composition comprising a substantially 

purified HGPRP in conjunction with a suitable pharmaceutical carrier may be administered to a 
subject to treat or prevent a disorder associated with decreased expression or activity of HGPRP 
including, but not limited to, those provided above. 

In still another embodiment, an agonist which modulates the activity of HGPRP may be 
25 administered to a subject to treat or prevent a disorder associated with decreased expression or 
activity of HGPRP including, but not limited to, those listed above. 

In a further embodiment, an antagonist of HGPRP may be administered to a subject to 
treat or prevent a disorder associated increased expression or activity of HGPRP. Examples of 
such disorders include, but are not limited to, those described above. In one aspect, an antibody 
30 which specifically binds HGPRP may be used directly as an antagonist or indirectly as a targeting 
or delivery mechanism for bringing a pharmaceutical agent to cells or tissue which express 
HGPRP. 

In an additional embodiment, a vector expressing the complement of the polynucleotide 
encoding HGPRP may be administered to a subject to treat or prevent a disorder associated 
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increased expression or activity of HGPRP including, but not limited to, those described above. 

In other embodiments, any of the proteins, antagonists, antibodies, agonists, 
complementary sequences, or vectors of the invention may be administered in combination with 
other appropriate therapeutic agents. Selection of the appropriate agents for use in combination 
5 therapy may be made by one of ordinary skill in the art, according to conventional pharmaceutical 
principles. The combination of therapeutic agents may act synergisticaily to effect the treatment 
or prevention of the various disorders described above. Using this approach, one may be able to 
achieve therapeutic efficacy with lower dosages of each agent, thus reducing the potential for 
adverse side effects. 

1 0 An antagonist of HGPRP may be produced using methods which are generally known in 

the art. In particular, purified HGPRP may be used to produce antibodies or to screen libraries of 
pharmaceutical agents to identify those which specifically bind HGPRP. Antibodies to HGPRP 
may also be generated using methods that are well known in the art. Such antibodies may include, 
but are not limited to, polyclonal, monoclonal, chimeric, and single chain antibodies. Fab 

15 fragments, and fragments produced by a Fab expression library. Neutralizing antibodies (e.g., 
those which inhibit dimer formation) are especially preferred for therapeutic use. 

For the production of antibodies, various hosts including goats, rabbits, rats, mice, 
humans, and others may be immunized by injection with HGPRP or with any fragment or 
oligopeptide thereof which has immunogenic properties. Depending on the host species, various 

20 adjuvants may be used to increase immunological response. Such adjuvants include, but are not 
limited to, Freund's, mineral gels such as aluminum hydroxide, and surface active substances such 
as lysolecithin, piuronic polyols, polyanions, peptides, oil emulsions, KLH, and dinitrophenol. 
Among adjuvants used in humans. BCG (bacilli Calmette-Guerin) and Corvnebacterium parvum 
are especially preferable. 

25 It is preferred that the oligopeptides, peptides, or fragments used to induce antibodies to 

HGPRP have an amino acid sequence consisting of at least about 5 amino acids, and. more 
preferably, of at least about 10 amino acids. It is also preferable that these oligopeptides, peptides, 
or fragments are identical to a portion of the amino acid sequence of the natural protein and 
contain the entire amino acid sequence of a small, naturally occurring molecule. Short stretches of 

30 HGPRP amino acids may be fused with those of another protein, such as KLH, and antibodies to 
the chimeric molecule may be produced. 

Monoclonal antibodies to HGPRP may be prepared using any technique which provides 
for the production of antibody molecules by continuous cell lines in culture. These include, but 
are not limited to, the hybridoma technique, the human B-cell hybridoma technique, and the EBV- 
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hybridoma technique. (See, e.g., Kohler, G. et al. (1975) Nature 256:495-497; Kozbor, D. et ai. 
(1985) J. Immunol. Methods 81:31-42; Cote, R.J. et al. (1983) Proc. Natl. Acad. Sci. USA 
80:2026-2030; and Cole, S.P. et ai. (1984) Mol. Cell Biol. 62:109-120.) 

In addition, techniques developed for the production of "chimeric antibodies/' such as the 
5 splicing of mouse antibody genes to human antibody genes to obtain a molecule with appropriate 
antigen specificity and biological activity, can be used. (See, e.g., Morrison, S.L. et al. (1984) 
Proc. Natl. Acad. Sci. USA 81:6851-6855; Neuberger, M.S. et al. (1984) Nature 3 12:604-608; and 
Takeda, S, et al. (1985) Nature 3 14:452-454.) Alternatively, techniques described for the 
production of single chain antibodies may be adapted, using methods known in the art, to produce 
1 0 HGPRP-specific single chain antibodies. Antibodies with related specificity, but of distinct 
idiotypic composition, may be generated by chain shuffling from random combinatorial 
immunoglobulin libraries. (See, e.g., Burton, D.R. (1991) Proc. Natl. Acad. Sci. USA 88:10134- 
10137.) 

Antibodies may also be produced by inducing in vivo production in the lymphocyte 

1 5 population or by screening immunoglobulin libraries or panels of highly specific binding reagents 
as disclosed in the literature. (See, e.g., Orlandi, R. et al. (1989) Proc. Natl. Acad. Sci. USA 
86:3833-3837; Winter, G. et al. (1991) Nature 349:293-299.) 

Antibody fragments which contain specific binding sites for HGPRP may also be 
generated. For example, such fragments include, but are not limited to, F(ab')2 fragments 

20 produced by pepsin digestion of the antibody molecule and Fab fragments generated by reducing 
the disulfide bridges of the F(ab*)2 fragments. Alternatively, Fab expression libraries may be 
constructed to allow rapid and easy identification of monoclonal Fab fragments with the desired 
specificity. (See, e.g., Huse, W.D. et al. (1989) Science 246:1275-1281.) 

Various immunoassays may be used for screening to identify antibodies having the 

25 desired specificity. Numerous protocols for competitive binding or immunoradiometric assays 
using either polyclonal or monoclonal antibodies with established specificities are well known in 
the art. Such immunoassays typically involve the measurement of complex formation between 
HGPRP and its specific antibody. A two-site, monoclonal-based immunoassay utilizing 
monoclonal antibodies reactive to two non-interfering HGPRP epitopes is preferred, but a 

30 competitive binding assay may also be employed (Pound, supra). 

Various methods such as Scatchard analysis in conjunction with radioimmunoassay 
techniques may be used to assess the affinity of antibodies for HGPRP. Affinity is expressed as 
an association constant, K a , which is defined as the molar concentration of HGPRP-antibody 
complex divided by the molar concentrations of free antigen and free antibody under equilibrium 

35 conditions. The K a determined for a preparation of polyclonal antibodies, which are 
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heterogeneous in their affinities for multiple HGPRP epitopes, represents the average affinity, or 
avidity, of the antibodies for HGPRP. The K a determined for a preparation of monoclonal 
antibodies, which are monospecific for a particular HGPRP epitope, represents a true measure of 
affinity. High-affinity antibody preparations with K a ranging from about 10 9 to 10 12 L/mole are 
5 preferred for use in immunoassays in which the HGPRP-antibody complex must withstand 

rigorous manipulations. Low-affinity antibody preparations with ranging from about 10 6 to 10 7 
L/mole are preferred for use in immunopuriflcation and similar procedures which ultimately 
require dissociation of HGPRP, preferably in active form, from the antibody (Catty, D. (1988) 
Antibodies. Volume 1: A Practical Approach , IRL Press, Washington, DC; Liddell, J.E. and Cryer, 

10 A. ( 1 99 1 ) A Practical Guide to Monoclonal Antibodies , John Wiley & Sons, New York NY). 

The titer and avidity of polyclonal antibody preparations may be further evaluated to 
determine the quality and suitability of such preparations for certain downstream applications. For 
example, a polyclonal antibody preparation containing at least 1-2 mg specific antibody/ml, 
preferably 5-10 mg specific antibody/ml, is preferred for use in procedures requiring precipitation 

1 5 of HGPRP-antibody complexes. Procedures for evaluating antibody specificity, titer, and avidity, 
and guidelines for antibody quality and usage in various applications, are generally available. 
(See, e.g., Catty, supra , and Coligan et al. supra.) 

In another embodiment of the invention, the polynucleotides encoding HGPRP, or any 
fragment or complement thereof, may be used for therapeutic purposes. In one aspect, the 

20 complement of the polynucleotide encoding HGPRP may be used in situations in which it would 
be desirable to block the transcription of the mRNA. In particular, cells may be transformed with 
sequences complementary to polynucleotides encoding HGPRP. Thus, complementary molecules 
or fragments may be used to modulate HGPRP activity, or to achieve regulation of gene function. 
Such technology is now well known in the art, and sense or antisense oligonucleotides or larger 

25 fragments can be designed from various locations along the coding or control regions of sequences 
encoding HGPRP. 

Expression vectors derived from retroviruses, adenoviruses, or herpes or vaccinia viruses, 
or from various bacterial plasmids, may be used for delivery of nucleotide sequences to the 
targeted organ, tissue, or cell population. Methods which are well known to those skilled in the art 
30 can be used to construct vectors to express nucleic acid sequences complementary to the 
polynucleotides encoding HGPRP. (See, e.g., Sambrook, su^ra; Ausubel, 1995, su^ra.) 

Genes encoding HGPRP can be turned off by transforming a cell or tissue with expression 
vectors which express high levels of a polynucleotide, or fragment thereof, encoding HGPRP. 
Such constructs may be used to introduce untranslatable sense or antisense sequences into a cell. 
35 Even in the absence of integration into the DNA, such vectors may continue to transcribe RNA 
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molecules until they are disabled by endogenous nucleases. Transient expression may last for a 
month or more with a non-replicating vector, and may last even longer if appropriate replication 
elements are part of the vector system. 

As mentioned above, modifications of gene expression can be obtained by designing 
5 complementary sequences or antisense molecules (DNA, RNA, or PNA) to the control, 5', or 
regulatory regions of the gene encoding HGPRP. Oligonucleotides derived from the transcription 
initiation site, e.g., between about positions -10 and +10 from the start site, are preferred. 
Similarly, inhibition can be achieved using triple helix base-pairing methodology. Triple helix 
pairing is useful because it causes inhibition of the ability of the double helix to open sufficiently 
1 0 for the binding of polymerases, transcription factors, or regulatory molecules. Recent therapeutic 
advances using triplex DNA have been described in the literature. (See, e.g., Gee, J.E. et ah 
(1994) in Huber, B.E. and B.I. Carr, Molecular and Immunologic Approaches . Futura Publishing, 
Mt. KiscoNY, pp. 163-177.) A complementary sequence or antisense molecule may also be 
designed to block translation of mRNA by preventing the transcript from binding to ribosomes. 
1 5 Ribozymes, enzymatic RNA molecules, may also be used to catalyze the specific cleavage 

of RNA. The mechanism of ribozyme action involves sequence-specific hybridization of the 
ribozyme molecule to complementary target RNA, followed by endonucleolytic cleavage. For 
example, engineered hammerhead motif ribozyme molecules may specifically and efficiently 
catalyze endonucleolytic cleavage of sequences encoding HGPRP. 
20 Specific ribozyme cleavage sites within any potential RNA target are initially identified by 

scanning the target molecule for ribozyme cleavage sites, including the following sequences: 
GUA, GUU, and GUC. Once identified, short RNA sequences of between 15 and 20 
ribonucleotides, corresponding to the region of the target gene containing the cleavage site, may 
be evaluated for secondary structural features which may render the oligonucleotide inoperable. 
25 The suitability of candidate targets may also be evaluated by testing accessibility to hybridization 
with complementary oligonucleotides using ribonuclease protection assays. 

Complementary ribonucleic acid molecules and ribozymes of the invention may be 
prepared by any method known in the art for the synthesis of nucleic acid molecules. These 
include techniques for chemically synthesizing oligonucleotides such as solid phase 
30 phosphoramidite chemical synthesis. Alternatively, RNA molecules may be generated by in vitro 
and in vivo transcription of DNA sequences encoding HGPRP. Such DNA sequences may be 
incorporated into a wide variety of vectors with suitable RNA polymerase promoters such as T7 or 
SP6. Alternatively, these cDNA constructs that synthesize complementary RNA, constitutively or 
inducibly, can be introduced into cell lines, cells, or tissues. 
35 RNA molecules may be modified to increase intracellular stability and half-life. Possible 
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modifications include, but are not limited to, the addition of flanking sequences at the 5* and/or 3' 
ends of the molecule, or the use of phosphorothioate or 2' O-methyl rather than phosphodiesterase 
linkages within the backbone of the molecule. This concept is inherent in the production of PNAs 
and can be extended in all of these molecules by the inclusion of nontraditional bases such as 
5 inosine, queosine, and wybutosine, as well as acetyl-, methyl-, thio-, and similarly modified forms 
of adenine, cytidine, guanine, thymine, and uridine which are not as easily recognized by 
endogenous endonucleases. 

Many methods for introducing vectors into cells or tissues are available and equally 
suitable for use in vivo , in vitro , and ex vivo . For ex vivo therapy, vectors may be introduced into 

10 stem cells taken from the patient and clonally propagated for autologous transplant back into that 
same patient. Delivery by transfection, by liposome injections, or by polycationic amino polymers 
may be achieved using methods which are well known in the art. (See, e.g., Goldman, C.K. et al. 
(1 997) Nat. Biotechnol. 15:462-466,) 

Any of the therapeutic methods described above may be applied to any subject in need of 

15 such therapy, including, for example, mammals such as dogs, cats, cows, horses, rabbits, 
monkeys, and most preferably, humans. 

An additional embodiment of the invention relates to the administration of a 
pharmaceutical or sterile composition, in conjunction with a pharmaceutically acceptable carrier, 
for any of the therapeutic effects discussed above. Such pharmaceutical compositions may consist 

20 of HGPRP, antibodies to HGPRP, and mimetics, agonists, antagonists, or inhibitors of HGPRP. 
The compositions may be administered alone or in combination with at least one other agent, such 
as a stabilizing compound, which may be administered in any sterile, biocompatible 
pharmaceutical carrier including, but not limited to, saline, buffered saline, dextrose, and water. 
The compositions may be administered to a patient alone, or in combination with other agents, 

25 drugs, or hormones. 

The pharmaceutical compositions utilized in this invention may be administered by any 
number of routes including, but not limited to, oral, intravenous, intramuscular, intra-arterial, 
intramedullary, intrathecal, intraventricular, transdermal, subcutaneous, intraperitoneal, intranasal, 
enteral, topical, sublingual, or rectal means. 

30 In addition to the active ingredients, these pharmaceutical compositions may contain 

suitable pharmaceutically-acceptable carriers comprising excipients and auxiliaries which 
facilitate processing of the active compounds into preparations which can be used 
pharmaceutically. Further details on techniques for formulation and administration may be found 
in the latest edition of Remington's Pharmaceutical Sciences (Maack Publishing, Easton PA). 

35 Pharmaceutical compositions for oral administration can be formulated using 
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pharmaceutical^ acceptable carriers well known in the art in dosages suitable for oral 
administration. Such carriers enable the pharmaceutical compositions to be formulated as tablets, 
pills, dragees, capsules, liquids, gels, syrups, slurries, suspensions, and the like, for ingestion by 
the patient. 

5 Pharmaceutical preparations for oral use can be obtained through combining active 

compounds with solid excipient and processing the resultant mixture of granules (optionally, after 
grinding) to obtain tablets or dragee cores. Suitable auxiliaries can be added, if desired. Suitable 
excipients include carbohydrate or protein fillers, such as sugars, including lactose, sucrose, 
mannitol, and sorbitol; starch from corn, wheat, rice, potato, or other plants; cellulose, such as 

10 methyl cellulose, hydroxypropylmethyl-cellulose, or sodium carboxymethyicellulose; gums, 
including arabic and tragacanth; and proteins, such as gelatin and collagen. If desired, 
disintegrating or solubilizing agents may be added, such as the cross-linked polyvinyl pyrrolidone, 
agar, and alginic acid or a salt thereof, such as sodium alginate. 

Dragee cores may be used in conjunction with suitable coatings, such as concentrated 

1 5 sugar solutions, which may also contain gum arabic, talc, polyvinylpyrrolidone, carbopol gel, 
polyethylene glycol, and/or titanium dioxide, lacquer solutions, and suitable organic solvents or 
solvent mixtures. Dyestuffs or pigments may be added to the tablets or dragee coatings for 
product identification or to characterize the quantity of active compound, i.e., dosage. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 

20 gelatin, as well as soft, sealed capsules made of gelatin and a coating, such as glycerol or sorbitol. 
Push-fit capsules can contain active ingredients mixed with fillers or binders, such as lactose or 
starches, lubricants, such as talc or magnesium stearate, and, optionally, stabilizers. In soft 
capsules, the active compounds may be dissolved or suspended in suitable liquids, such as fatty 
oils, liquid, or liquid polyethylene glycol with or without stabilizers. 

25 Pharmaceutical formulations suitable for parenteral administration may be formulated in 

aqueous solutions, preferably in physiologically compatible buffers such as Hanks* solution, 
Ringer's solution, or physiologically buffered saline. Aqueous injection suspensions may contain 
substances which increase the viscosity of the suspension, such as sodium carboxymethyl 
cellulose, sorbitol, or dextran. Additionally, suspensions of the active compounds may be 

30 prepared as appropriate oily injection suspensions. Suitable lipophilic solvents or vehicles include 
fatty oils, such as sesame oil, or synthetic fatty acid esters, such as ethyl oleate, triglycerides, or 
liposomes. Non-lipid polycationic amino polymers may also be used for delivery. Optionally, the 
suspension may also contain suitable stabilizers or agents to increase the solubility of the 
compounds and allow for the preparation of highly concentrated solutions. 

35 For topical or nasal administration, penetrants appropriate to the particular barrier to be 
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permeated are used in the formulation. Such penetrants are generally known in the art. 

The pharmaceutical compositions of the present invention may be manufactured in a 
manner that is known in the art, e.g., by means of conventional mixing, dissolving, granulating, 
dragee-making, levigating, emulsifying, encapsulating, entrapping, or lyophilizing processes. 
5 The pharmaceutical composition may be provided as a salt and can be formed with many 

acids, including but not limited to, hydrochloric, sulfuric, acetic, lactic, tartaric, malic, and 
succinic acid. Salts tend to be more soluble in aqueous or other protonic solvents than are the 
corresponding free base forms. In other cases, the preferred preparation may be a lyophilized 
powder which may contain any or all of the following: 1 mM to 50 mM histidine, 0.1% to 2% 
1 0 sucrose, and 2% to 7% mannitol, at a pH range of 4.5 to 5.5, that is combined with buffer prior to 
use. 

After pharmaceutical compositions have been prepared, they can be placed in an 
appropriate container and labeled for treatment of an indicated condition. For administration of 
HGPRP, such labeling would include amount, frequency, and method of administration. 
1 5 Pharmaceutical compositions suitable for use in the invention include compositions 

wherein the active ingredients are contained in an effective amount to achieve the intended 
purpose. The determination of an effective dose is well within the capability of those skilled in the 
art. 

For any compound, the therapeutically effective dose can be estimated initially either in 

20 cell culture assays, e.g., of neoplastic cells or in animal models such as mice, rats, rabbits, dogs, or 
pigs. An animal model may also be used to determine the appropriate concentration range and 
route of administration. Such information can then be used to determine useful doses and routes 
for administration in humans. 

A therapeutically effective dose refers to that amount of active ingredient for example 

25 HGPRP or fragments thereof, antibodies of HGPRP, and agonists, antagonists or inhibitors of 
HGPRP, which ameliorates the symptoms or condition. Therapeutic efficacy and toxicity may be 
determined by standard pharmaceutical procedures in cell cultures or with experimental animals, 
such as by calculating the ED 50 (the dose therapeutically effective in 50% of the population) or 
LD 50 (the dose lethal to 50% of the population) statistics. The dose ratio of toxic to therapeutic 

30 effects is the therapeutic index, and it can be expressed as the LD SQ /ED S0 ratio. Pharmaceutical 
compositions which exhibit large therapeutic indices are preferred. The data obtained from cell 
culture assays and animal studies are used to formulate a range of dosage for human use. The 
dosage contained in such compositions is preferably within a range of circulating concentrations 
that includes the ED 50 with little or no toxicity. The dosage varies within this range depending 

35 upon the dosage form employed, the sensitivity of the patient, and the route of administration. 
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The exact dosage will be determined by the practitioner, in light of factors related to the 
subject requiring treatment. Dosage and administration are adjusted to provide sufficient levels of 
the active moiety or to maintain the desired effect. Factors which may be taken into account 
include the severity of the disease state, the general health of the subject, the age, weight, and 
5 gender of the subject, time and frequency of administration, drug combination(s), reaction 
sensitivities, and response to therapy. Long-acting pharmaceutical compositions may be 
administered every 3 to 4 days, every week, or biweekly depending on the half-life and clearance 
rate of the particular formulation. 

Normal dosage amounts may vary from about 0. 1 A*g to 1 00,000 ^g, up to a total dose of 

10 about 1 gram, depending upon the route of administration. Guidance as to particular dosages and 
methods of delivery is provided in the literature and generally available to practitioners in the art. 
Those skilled in the art will employ different formulations for nucleotides than for proteins or their 
inhibitors. Similarly, delivery of polynucleotides or polypeptides will be specific to particular 
cells, conditions, locations, etc. 

15 DIAGNOSTICS 

In another embodiment, antibodies which specifically bind HGPRP may be used for the 
diagnosis of disorders characterized by expression of HGPRP, or in assays to monitor patients 
being treated with HGPRP or agonists, antagonists, or inhibitors of HGPRP. Antibodies useful for 
diagnostic purposes may be prepared in the same manner as described above for therapeutics. 

20 Diagnostic assays for HGPRP include methods which utilize the antibody and a label to detect 
HGPRP in human body fluids or in extracts of cells or tissues. The antibodies may be used with 
or without modification, and may be labeled by covalent or non-covalent attachment of a reporter 
molecule. A wide variety of reporter molecules, several of which are described above, are known 
in the art and may be used. 

25 A variety of protocols for measuring HGPRP, including ELISAs, RIAs, and FACS, are 

known in the art and provide a basis for diagnosing altered or abnormal levels of HGPRP 
expression. Normal or standard values for HGPRP expression are established by combining body 
fluids or cell extracts taken from normal mammalian subjects, preferably human, with antibody to 
HGPRP under conditions suitable for complex formation. The amount of standard complex 

30 formation may be quantitated by various methods, preferably by photometric means. Quantities of 
HGPRP expressed in subject, control, and disease samples from biopsied tissues are compared 
with the standard values. Deviation between standard and subject values establishes the 
parameters for diagnosing disease. 

In another embodiment of the invention, the polynucleotides encoding HGPRP may be 

35 used for diagnostic purposes. The polynucleotides which may be used include oligonucleotide 
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sequences, complementary RNA and DNA molecules, and PNAs. The polynucleotides may be 
used to detect and quantitate gene expression in biopsied tissues in which expression of HGPRP 
may be correlated with disease. The diagnostic assay may be used to determine absence, 
presence, and excess expression of HGPRP, and to monitor regulation of HGPRP levels during 
5 therapeutic intervention. 

In one aspect, hybridization with PCR probes which are capable of detecting 
polynucleotide sequences, including genomic sequences, encoding HGPRP or closely related 
molecules may be used to identify nucleic acid sequences which encode HGPRP. The specificity 
of the probe, whether it is made from a highly specific region, e.g., the 5' regulatory region, or 

10 from a less specific region, e.g., a conserved motif, and the stringency of the hybridization or 
amplification (maximal, high, intermediate, or low), will determine whether the probe identifies 
only naturally occurring sequences encoding HGPRP. allelic variants, or related sequences. 

Probes may also be used for the detection of related sequences, and should preferably 
have at least 50% sequence identity to any of the HGPRP encoding sequences. The hybridization 

1 5 probes of the subject invention may be DNA or RNA and may be derived from the sequence of 
SEQ ID NO:7-12 or from genomic sequences including promoters, enhancers, and introns of the 
HGPRP gene. 

Means for producing specific hybridization probes for DNAs encoding HGPRP include 
the cloning of polynucleotide sequences encoding HGPRP or HGPRP derivatives into vectors for 

20 the production of mRNA probes. Such vectors are known in the art, are commercially available, 
and may be used to synthesize RNA probes in vitro by means of the addition of the appropriate 
RNA polymerases and the appropriate labeled nucleotides. Hybridization probes may be labeled 
by a variety of reporter groups, for example, by radionuclides such as n ? or 35 S, or by enzymatic 
labels, such as alkaline phosphatase coupled to the probe via avidin/biotin coupling systems, and 

25 the like. 

Polynucleotide sequences encoding HGPRP may be used for the diagnosis of disorders 
associated with expression of HGPRP. Examples of such disorders include, but are not limited to. 
a cell proliferative disorder such as actinic keratosis, arteriosclerosis, atherosclerosis, bursitis, 
cirrhosis, hepatitis, mixed connective tissue disease (MCTD), myelofibrosis, paroxysmal nocturnal 

30 hemoglobinuria, polycythemia vera, psoriasis, primary thrombocythemia; cancers including 
adenocarcinoma, leukemia, lymphoma, melanoma, myeloma, sarcoma, teratocarcinoma, and, in 
particular, cancers of the adrenal gland, bladder, bone, bone marrow, brain, breast, cervix, gall 
bladder, ganglia, gastrointestinal tract, heart, kidney, liver, lung, muscle, ovary, pancreas, 
parathyroid, penis, prostate, salivary glands, skin, spleen, testis, thymus, thyroid, and uterus; an 

35 immune disorder such as acquired immunodeficiency syndrome (AIDS), Addison's disease, adult 
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respiratory distress syndrome, allergies, ankylosing spondylitis, amyloidosis, anemia, asthma, 
atherosclerosis, autoimmune hemolytic anemia, autoimmune thyroiditis, bronchitis, cholecystitis, 
contact dermatitis, Crohn's disease, atopic dermatitis, dermatomyositis, diabetes meilitus, 
emphysema, episodic lymphopenia with lymphocytotoxins, erythroblastosis fetalis, erythema 
5 nodosum, atrophic gastritis, glomerulonephritis. Goodpasture's syndrome, gout, Graves ? disease, 
Hashimoto's thyroiditis, hypereosinophilia, irritable bowel syndrome, multiple sclerosis, 
myasthenia gravis, myocardial or pericardial inflammation, osteoarthritis, osteoporosis, 
pancreatitis, polymyositis, psoriasis, Reiter's syndrome, rheumatoid arthritis, scleroderma, 
Sjogren ; s syndrome, systemic anaphylaxis, systemic lupus erythematosus, systemic sclerosis, 

1 0 thrombocytopenic purpura, ulcerative colitis, uveitis, Werner syndrome, complications of cancer, 
hemodialysis, and extracorporeal circulation, viral, bacterial, fungal, parasitic, protozoal, and 
helminthic infections, and trauma; and a neurological disorder such as epilepsy, ischemic 
cerebrovascular disease, stroke, cerebral neoplasms, Alzheimer's disease, Pick's disease, 
Huntington's disease, dementia, Parkinson's disease and other extrapyramidal disorders, 

1 5 amyotrophic lateral sclerosis and other motor neuron disorders, progressive neural muscular 
atrophy, retinitis pigmentosa, hereditary ataxias, multiple sclerosis and other demyelinating 
diseases, bacterial and viral meningitis, brain abscess, subdural empyema, epidural abscess, 
suppurative intracranial thrombophlebitis, myelitis and radiculitis, viral central nervous system 
disease; prion diseases including kuru, Creutzfeldt- Jakob disease, and Gerstmann- 

20 Straussler-Scheinker syndrome; fatal familial insomnia, nutritional and metabolic diseases of the 
nervous system, neurofibromatosis, tuberous sclerosis, cerebelloretinal hemangioblastomatosis, 
encephalotrigeminal syndrome, mental retardation and other developmental disorders of the 
central nervous system, cerebral palsy, neuroskeletal disorders, autonomic nervous system 
disorders, cranial nerve disorders, spinal cord diseases, muscular dystrophy and other 

25 neuromuscular disorders, peripheral nervous system disorders, dermatomyositis and polymyositis; 
inherited, metabolic, endocrine, and toxic myopathies; myasthenia gravis, periodic paralysis: 
mental disorders including mood, anxiety, and schizophrenic disorders; akathesia, amnesia, 
catatonia, diabetic neuropathy, tardive dyskinesia, dystonias, paranoid psychoses, postherpetic 
neuralgia, and Tourette's disorder. The polynucleotide sequences encoding HGPRP may be used 

30 in Southern or northern analysis, dot blot, or other membrane-based technologies; in PCR 
technologies; in dipstick, pin, and multiformat ELISA-like assays; and in microarrays utilizing 
fluids or tissues from patients to detect altered HGPRP expression. Such qualitative or 
quantitative methods are well known in the art. 

In a particular aspect, the nucleotide sequences encoding HGPRP may be useful in assays 

35 that detect the presence of associated disorders, particularly those mentioned above. The 
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nucleotide sequences encoding HGPRP may be labeled by standard methods and added to a fluid 
or tissue sample from a patient under conditions suitable for the formation of hybridization 
complexes. After a suitable incubation period, the sample is washed and the signal is quantitated 
and compared with a standard value. If the amount of signal in the patient sample is significantly 
5 altered in comparison to a control sample then the presence of altered levels of nucleotide 

sequences encoding HGPRP in the sample indicates the presence of the associated disorder. Such 
assays may also be used to evaluate the efficacy of a particular therapeutic treatment regimen in 
animal studies, in clinical trials, or to monitor the treatment of an individual patient. 

In order to provide a basis for the diagnosis of a disorder associated with expression of 

10 HGPRP, a normal or standard profile for expression is established. This may be accomplished by 
combining body fluids or cell extracts taken from normal subjects, either animal or human, with a 
sequence, or a fragment thereof, encoding HGPRP, under conditions suitable for hybridization or 
amplification. Standard hybridization may be quantified by comparing the values obtained from 
normal subjects with values from an experiment in which a known amount of a substantially 

1 5 purified polynucleotide is used. Standard values obtained in this manner may be compared with 
values obtained from samples from patients who are symptomatic for a disorder. Deviation from 
standard values is used to establish the presence of a disorder. 

Once the presence of a disorder is established and a treatment protocol is initiated, 
hybridization assays may be repeated on a regular basis to determine if the level of expression in 

20 the patient begins to approximate that which is observed in the normal subject. The results 
obtained from successive assays may be used to show the efficacy of treatment over a period 
ranging from several days to months. 

With respect to cancer, the presence of an abnormal amount of transcript (either under- or 
overexpressed) in biopsied tissue from an individual may indicate a predisposition for the 

25 development of the disease, or may provide a means for detecting the disease prior to the 

appearance of actual clinical symptoms. A more definitive diagnosis of this type may allow health 
professionals to employ preventative measures or aggressive treatment earlier thereby preventing 
the development or further progression of the cancer. 

Additional diagnostic uses for oligonucleotides designed from the sequences encoding 

30 HGPRP may involve the use of PCR. These oligomers may be chemically synthesized, generated 
enzymatically, or produced in vitro . Oligomers will preferably contain a fragment of a 
polynucleotide encoding HGPRP. or a fragment of a polynucleotide complementary to the 
polynucleotide encoding HGPRP, and will be employed under optimized conditions for 
identification of a specific gene or condition. Oligomers may also be employed under less 

35 stringent conditions for detection or quantitation of closely related DNA or RNA sequences. 
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Methods which may also be used to quantitate the expression of HGPRP include 
radiolabeling or biotinylating nucleotides, coamplification of a control nucleic acid, and 
interpolating results from standard curves. (See, e.g., Melby, P.C. et al. (1993) J. Immunol. 
Methods 159:235-244; Duplaa, C. et al. (1993) Anal. Biochem. 212:229-236.) The speed of 
5 quantitation of multiple samples may be accelerated by running the assay in an ELISA format 
where the oligomer of interest is presented in various dilutions and a spectrophotometric or 
colorimetric response gives rapid quantitation. 

In further embodiments, oligonucleotides or longer fragments derived from any of the 
polynucleotide sequences described herein may be used as targets in a microarray. The 
10 microarray can be used to monitor the expression level of large numbers of genes simultaneously 
and to identify genetic variants, mutations, and polymorphisms. This information may be used to 
determine gene function, to understand the genetic basis of a disorder, to diagnose a disorder, and 
to develop and monitor the activities of therapeutic agents. 

Microarrays may be prepared, used, and analyzed using methods known in the art. (See, 
15 e.g., Brennan, T.M. et al. (1995) U.S. Patent No. 5,474,796; Schena, M. et al. (1996) Proc. Natl. 
Acad. Sci. USA 93:10614-10619; Baldeschweiier et al. (1995) PCT application W095/251 1 16; 
Shalon, D. et al. (1995) PCT application WO95/35505; Heller, R.A. et al. (1997) Proc. Natl. Acad. 
Sci. USA 94:2150-2155; and Heller, M.J. et al. (1997) U.S. Patent No. 5,605,662.) 

In another embodiment of the invention, nucleic acid sequences encoding HGPRP may be 
20 used to generate hybridization probes useful in mapping the naturally occurring genomic 

sequence. The sequences may be mapped to a particular chromosome, to a specific region of a 
chromosome, or to artificial chromosome constructions, e.g., human artificial chromosomes 
(HACs), yeast artificial chromosomes (YACs), bacterial artificial chromosomes (BACs), bacterial 
PI constructions, or single chromosome cDNA libraries. (See, e.g., Harrington, J.J. et al. (1997) 
25 Nat. Genet. 15:345-355; Price, CM. (1993) Blood Rev. 7:127-134; and Trask, B.J. (1991) Trends 
Genet. 7:149-154.) 

Fluorescent in situ hybridization (FISH) may be correlated with other physical 
chromosome mapping techniques and genetic map data. (See, e.g., Heinz-Ulrich, et al. (1995) in 
Meyers, supra, pp. 965-968.) Examples of genetic map data can be found in various scientific 

30 journals or at the Online Mendelian Inheritance in Man (OMIM) site. Correlation between the 
location of the gene encoding HGPRP on a physical chromosomal map and a specific disorder, or 
a predisposition to a specific disorder, may help define the region of DNA associated with that 
disorder. The nucleotide sequences of the invention may be used to detect differences in gene 
sequences among normal, carrier, and affected individuals. 

35 In situ hybridization of chromosomal preparations and physical mapping techniques, such 
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as linkage analysis using established chromosomal markers, may be used for extending genetic 
maps. Often the placement of a gene on the chromosome of another mammalian species, such as 
mouse, may reveal associated markers even if the number or arm of a particular human 
chromosome is not known. New sequences can be assigned to chromosomal arms by physical 
5 mapping. This provides valuable information to investigators searching for disease genes using 
positional cloning or other gene discovery techniques. Once the disease or syndrome has been 
crudelv localized by genetic linkage to a particular genomic region, e.g., ataxia-telangiectasia to 
1 lq22-23, any sequences mapping to that area may represent associated or regulatory genes for 
further investigation. (See, e.g., Gatti, R.A. et al. (1988) Nature 336:577-580.) The nucleotide 

1 0 sequence of the subject invention may also be used to detect differences in the chromosomal 
location due to translocation, inversion, etc., among normal, carrier, or affected individuals. 

In another embodiment of the invention, HGPRP, its catalytic or immunogenic fragments, 
or oligopeptides thereof can be used for screening libraries of compounds in any of a variety of 
drug screening techniques. The fragment employed in such screening may be free in solution. 

1 5 affixed to a solid support, borne on a cell surface, or located intracellularly. The formation of 
binding complexes between HGPRP and the agent being tested may be measured. 

Another technique for drug screening provides for high throughput screening of 
compounds having suitable binding affinity to the protein of interest. (See, e.g., Geysen, et al. 
(1984) PCT application WO84/03564.) In this method, large numbers of different small test 

20 compounds are synthesized on a solid substrate. The test compounds are reacted with HGPRP, or 
fragments thereof, and washed. Bound HGPRP is then detected by methods well known in the art. 
Purified HGPRP can also be coated directly onto plates for use in the aforementioned drug 
screening techniques. Alternatively, non-neutralizing antibodies can be used to capture the 
peptide and immobilize it on a solid support. 

25 In another embodiment, one may use competitive drug screening assays in which 

neutralizing antibodies capable of binding HGPRP specifically compete with a test compound for 
binding HGPRP. In this manner, antibodies can be used to detect the presence of any peptide 
which shares one or more antigenic determinants with HGPRP. 

In additional embodiments, the nucleotide sequences which encode HGPRP may be used 

30 in any molecular biology techniques that have yet to be developed, provided the new techniques 
rely on properties of nucleotide sequences that are currently known, including, but not limited to. 
such properties as the triplet genetic code and specific base pair interactions. 

Without further elaboration, it is believed that one skilled in the art can, using the 
35 preceding description, utilize the present invention to its fullest extent. The following preferred 
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specific embodiments are, therefore, to be construed as merely illustrative, and not limitative of 
the remainder of the disclosure in any way whatsoever. 

The disclosures of all patents, applications, and publications mentioned above and below, 
in particular U.S. Ser. No. 09/156,513, are hereby expressly incorporated by reference. 

5 

EXAMPLES 

I. Construction of cDNA Libraries 

RNA was purchased from Clontech or isolated from tissues described in Table 4. Some 
tissues were homogenized and lysed in guanidinium isothiocyanate, while others were 
1 0 homogenized and lysed in phenol or in a suitable mixture of denaturants, such as TRIZOL (Life 
Technologies), a monophasic solution of phenol and guanidine isothiocyanate. The resulting 
lysates were centrifuged over CsCl cushions or extracted with chloroform. RNA was precipitated 
from the lysates with either isopropanol or sodium acetate and ethanol, or by other routine 
methods. 

i 5 Phenol extraction and precipitation of RNA were repeated as necessary to increase RNA 

purity. In some cases, RNA was treated with DNase. For most libraries, poly(A+) RNA was 
isolated using oligo d(T)-coupled paramagnetic particles (Promega), OLIGOTEX latex particles 
(Qiagen, Valencia CA), or an OLIGOTEX mRNA purification kit (Qiagen). Alternatively, RNA 
was isolated directly from tissue lysates using other RNA isolation kits, e.g., the POLY(A)PURE 

20 mRNA purification kit (Ambion, Austin TX). 

In some cases, Stratagene was provided with RNA and constructed the corresponding 
cDNA libraries. Otherwise, cDNA was synthesized and cDNA libraries were constructed with the 
UNIZAP vector system (Stratagene) or SUPERSCRIPT plasm id system (Life Technologies), 
using the recommended procedures or similar methods known in the art. (See, e.g., Ausubel, 

25 1997, supra , units 5. 1-6.6.) Reverse transcription was initiated using oligo d(T) or random 

primers. Synthetic oligonucleotide adapters were ligated to double stranded cDNA, and the cDNA 
was digested with the appropriate restriction enzyme or enzymes. For most libraries, the cDNA 
was size-selected (300-1000 bp) using SEPHACRYL SI 000, SEPHAROSE CL2B ? or 
SEPHAROSE CL4B column chromatography (Amersham Pharmacia Biotech) or preparative 

30 agarose gel electrophoresis. cDNAs were ligated into compatible restriction enzyme sites of the 
polylinker of a suitable plasmid, e.g., PBLUESCRIPT plasmid (Stratagene), PSPORT1 plasmid 
(Life Technologies), or pINCY (Incyte Pharmaceuticals, Palo Alto CA). Recombinant plasmids 
were transformed into competent E^coh ceils including XLl-Blue, XLl-BIueMRF, or SOLR from 
Stratagene, or DH5a, DH10B, or ElectroMAX DH10B from Life Technologies. 
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II. Isolation of cDNA Clones 

Plasmids were recovered from host cells by in vivo excision, using the UNIZAP vector 
system (Stratagene) or cell lysis. Plasmids were purified using at least one of the following: a 
MAGIC or WIZARD MINIPREPS DNA purification system (Promega); an AGTC MINIPREP 
5 purification kit (Edge Biosystems, Gaithersburg MD); and QIAWELL 8 Plasmid, QIAWELL 8 
Plus Plasmid. QIAWELL 8 Ultra Plasmid purification systems or the REAL Prep 96 plasmid kit 
from Qiagen. Following precipitation, plasmids were resuspended in 0.1 ml of distilled water and 
stored, with or without lyophilization, at 4°C. 

Alternatively, plasmid DNA was amplified from host cell lysates using direct link PCR in 
10 a high-throughput format (Rao, V.B. (1994) Anal. Biochem. 216:1-14). Host cell lysis and 
thermal cycling steps were carried out in a single reaction mixture. Samples were processed and 
stored in 384-well plates, and the concentration of amplified plasmid DNA was quantified 
fluorometrically using PICOGREEN dye (Molecular Probes, Eugene OR) and a Fiuoroskan II 
fluorescence scanner (Labsystems Oy, Helsinki, Finland). 
15 III. Sequencing and Analysis 

The cDNAs were prepared for sequencing using the ABI CATALYST 800 (PE 
Biosystems) or the HYDRA microdispenser (Robbins Scientific) or MICROLAB 2200 (Hamilton) 
systems in combination with the DNA ENGINE thermal cyclers (MJ Research). The cDNAs were 
sequenced using the ABI PRISM 373 or 377 sequencing systems (PE Biosystems) and standard 
20 ABI protocols, base calling software, and kits. In one alternative, cDNAs were sequenced using 
the MEGABACE 1000 DNA sequencing system (Amersham Pharmacia Biotech). In another 
alternative, the cDNAs were amplified and sequenced using the ABI PRISM BIGDYE Terminator 
cycle sequencing ready reaction kit (PE Biosystems). In yet another alternative, cDNAs were 
sequenced using solutions and dyes from Amersham Pharmacia Biotech. Reading frames for the 
25 ESTs were determined using standard methods (reviewed in Ausubel, 1997. supra , unit 7.7). 
Some of the cDNA sequences were selected for extension using the techniques disclosed in 
Example V. 

The polynucleotide sequences derived from cDNA, extension, and shotgun sequencing 
were assembled and analyzed using a combination of software programs which utilize algorithms 

30 well known to those skilled in the art. Table 5 summarizes the software programs, descriptions, 
references, and threshold parameters used. The first column of Table 5 shows the tools, programs, 
and algorithms used, the second column provides a brief description thereof, the third column 
presents the references which are incorporated by reference herein, and the fourth column 
presents, where applicable, the scores, probability values, and other parameters used to evaluate 

35 the strength of a match between two sequences (the higher the probability the greater the 
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homology). Sequences were analyzed using MACDNASIS PRO software (Hitachi Software 
Engineering, S. San Francisco CA) and LASERGENE software (DNASTAR). 

The polynucleotide sequences were validated by removing vector, linker, and polyA 
sequences and by masking ambiguous bases, using algorithms and programs based on BLAST, 
5 dynamic programing, and dinucleotide nearest neighbor analysis. The sequences were then 
queried against a selection of public databases such as GENBANK primate, rodent, mammalian, 
vertebrate, and eukaryote databases, and BLOCKS to acquire annotation, using programs based 
on BLAST, FASTA, and BLIMPS. The sequences were assembled into full length polynucleotide 
sequences using programs based on Phred, Phrap, and Consed, and were screened for open 

10 reading frames using programs based on GeneMark, BLAST, and FASTA. The full length 
polynucleotide sequences were translated to derive the corresponding full length amino acid 
sequences, and these full length sequences were subsequently analyzed by querying against 
databases such as the GENBANK databases (described above), SWISSPROT, BLOCKS, 
PRINTS, PFAM, and PROSITE. 

15 The programs described above for the assembly and analysis of full length polynucleotide 

and amino acid sequences were also used to identify polynucleotide sequence fragments from 
SEQ ID NO:7-12. Fragments from about 20 to about 4000 nucleotides which are useful in 
hybridization and amplification technologies were described in The Invention section above. 
IV. Northern Analysis 

20 Northern analysis is a laboratory technique used to detect the presence of a transcript of a 

gene and involves the hybridization of a labeled nucleotide sequence to a membrane on which 
RNAs from a particular cell type or tissue have been bound. (See, e.g., Sambrook, su£ra, ch. 7; 
Ausubel, 1995, supra, ch. 4 and 16.) 

Analogous computer techniques applying BLAST were used to search for identical or 

25 related molecules in nucleotide databases such as GENBANK or LIFESEQ database (Incyte 
Pharmaceuticals). This analysis is much faster than multiple membrane-based hybridizations. In 
addition, the sensitivity of the computer search can be modified to determine whether any 
particular match is categorized as exact or similar. The basis of the search is the product score, 
which is defined as: 

30 % sequence identity x % maximum BLAST score 

100 

The product score takes into account both the degree of similarity between two sequences and the 
length of the sequence match. For example, with a product score of 40, the match will be exact 
within a 1% to 2% error, and, with a product score of 70, the match will be exact. Similar 
35 molecules are usually identified by selecting those which show product scores between 1 5 and 40, 
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although lower scores may identify related molecules. 

The results of northern analyses are reported a percentage distribution of libraries in which 
the transcript encoding HGPRP occurred. Analysis involved the categorization of cDNA libraries 
by organ/tissue and disease. The organ/tissue categories included cardiovascular, dermatologic, 
5 developmental, endocrine, gastrointestinal, hematopoietic/immune, musculoskeletal, nervous, 
reproductive, and urologic. The disease categories included cancer, inflammation/trauma, fetal, 
neurological, and pooled. For each category, the number of libraries expressing the sequence of 
interest was counted and divided by the total number of libraries across all categories. Percentage 
values of tissue-specific and disease expression are reported in Table 3. 

1 0 V. Extension of HGPRP Encoding Polynucleotides 

The full length nucleic acid sequence of SEQ ID NO:7-12 was produced by extension of 
an appropriate fragment of the full length molecule using oligonucleotide primers designed from 
this fragment. One primer was synthesized to initiate 5* extension of the known fragment, and the 
other primer, to initiate 3' extension of the known fragment. The initial primers were designed 

1 5 using OLIGO 4.06 software (National Biosciences), or another appropriate program, to be about 
22 to 30 nucleotides in length, to have a GC content of about 50% or more, and to anneal to the 
target sequence at temperatures of about 68°C to about 72°C. Any stretch of nucleotides which 
would result in hairpin structures and primer-primer dimerizations was avoided. 

Selected human cDNA libraries were used to extend the sequence. If more than one 

20 extension was necessary or desired, additional or nested sets of primers were designed. 

High fidelity amplification was obtained by PCR using methods well known in the art. 
PCR was performed in 96-well plates using the DNA ENGINE thermal cycler (MJ Research). 
The reaction mix contained DNA template, 200 nmol of each primer, reaction buffer containing 
Mg 2+ , (NH 4 ) 2 S0 4 , and p-mercaptoethanol, Taq DNA polymerase (Amersham Pharmacia Biotech), 

25 ELONGASE enzyme (Life Technologies), and Pfu DNA polymerase (Stratagene), with the 

following parameters for primer pair PCI A and PCI B: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; 
Step 3: 60°C, 1 min; Step 4: 68°C, 2 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 
68°C 5 min; Step 7: storage at 4°C. In the alternative, the parameters for primer pair T7 and SK+ 
were as follows: Step 1 : 94°C, 3 min; Step 2: 94 °C, 15 sec; Step 3: 57°C, 1 min; Step 4: 68°C, 2 

30 min; Step 5: Steps 2, 3, and 4 repeated 20 times; Step 6: 68°C, 5 min; Step 7: storage at 4°C. 
The concentration of DNA in each well was determined by dispensing 100 (il 
PICOGREEN quantitation reagent (0.25% (v/v) PICOGREEN; Molecular Probes, Eugene OR) 
dissolved in IX TE and 0.5 \i\ of undiluted PCR product into each well of an opaque fluorimeter 
plate (Corning Costar, Acton MA), allowing the DNA to bind to the reagent. The plate was 

35 scanned in a Fluoroskan II (Labsystems Oy, Helsinki. Finland) to measure the fluorescence of the 
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sample and to quantify the concentration of DNA. A 5 fj\ to 10 ^1 aliquot of the reaction mixture 
was analyzed by electrophoresis on a 1 % agarose mini-gel to determine which reactions were 
successful in extending the sequence. 

The extended nucleotides were desalted and concentrated, transferred to 3 84- well plates, 
5 digested with CviJI cholera virus endonuclease (Molecular Biology Research, Madison WI), and 
sonicated or sheared prior to religation into pUC 1 8 vector (Amersham Pharmacia Biotech). For 
shotgun sequencing, the digested nucleotides were separated on low concentration (0.6 to 0.8%) 
agarose gels, fragments were excised, and agar digested with Agar ACE (Promega). Extended 
clones were reiigated using T4 ligase (New England Biolabs, Beverly MA) into pUC 18 vector 

10 (Amersham Pharmacia Biotech), treated with Pfu DNA polymerase (Stratagene) to fili-in 

restriction site overhangs, and transfected into competent _E,_colj cells. Transformed cells were 
selected on antibiotic-containing media, individual colonies were picked and cultured overnight at 
37°C in 384-well plates in LB/2x carb liquid media. 

The cells were lysed, and DNA was amplified by PCR using Taq DNA polymerase 

1 5 (Amersham Pharmacia Biotech) and Pfu DNA polymerase (Stratagene) with the following 

parameters: Step 1: 94°C, 3 min; Step 2: 94°C, 15 sec; Step 3: 60°C, 1 min; Step 4: 72°C, 2 min; 
Step 5: steps 2, 3, and 4 repeated 29 times; Step 6: 72°C, 5 min; Step 7: storage at 4°C. DNA was 
quantified by PICOGREEN reagent (Molecular Probes) as described above. Samples with low 
DNA recoveries were reamplified using the same conditions as described above. Samples were 

20 diluted with 20% dimethysulphoxide (1:2, v/v), and sequenced using DYENAMIC energy transfer 
sequencing primers and the DYENAMIC DIRECT kit (Amersham Pharmacia Biotech) or the ABI 
PRISM BIGDYE Terminator cycle sequencing ready reaction kit (PE Biosystems). 

In like manner, the nucleotide sequence of SEQ ID NO:7-12 is used to obtain 5' 
regulatory sequences using the procedure above, oligonucleotides designed for such extension, 

25 and an appropriate genomic library. 

VI. Labeling and Use of Individual Hybridization Probes 

Hybridization probes derived from SEQ ID NO:7-12 are employed to screen cDNAs, 
genomic DNAs, or mRNAs. Although the labeling of oligonucleotides, consisting of about 20 
base pairs, is specifically described, essentially the same procedure is used with larger nucleotide 

30 fragments. Oligonucleotides are designed using state-of-the-art software such as OLIGO 4.06 
software (National Biosciences) and labeled by combining 50 pmol of each oligomer, 250 fuCi of 
[y- 32 P] adenosine triphosphate (Amersham Pharmacia Biotech), and T4 polynucleotide kinase 
(DuPont NEN, Boston MA). The labeled oligonucleotides are substantially purified using a 
SEPHADEX G-25 superfine size exclusion dextran bead column (Amersham Pharmacia Biotech). 

35 An aliquot containing 10 7 counts per minute of the labeled probe is used in a typical membrane- 
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based hybridization analysis of human genomic DNA digested with one of the following 
endonucleases: Ase I, Bgl II, Eco RI, Pst I, Xba I, or Pvu II (DuPont NEN). 

The DNA from each digest is fractionated on a 0.7% agarose gel and transferred to nylon 
membranes (NYTRAN PLUS, Schleicher & Schuell, Durham NH). Hybridization is carried out 
5 for 16 hours at 40°C. To remove nonspecific signals, blots are sequentially washed at room 
temperature under increasingly stringent conditions up to 0.1 x saline sodium citrate and 0.5% 
sodium dodecyl sulfate. After XOMAT-AR film (Eastman Kodak, Rochester NY) is exposed to 
the blots for several hours, hybridization patterns are compared. 

VII. Microarrays 

1 o A chemical coupling procedure and ah ink jet device can be used to synthesize array 

elements on the surface of a substrate. (See, e.g., Baldeschweiler, suora.) An array analogous to a 
dot or slot blot may also be used to arrange and link elements to the surface of a substrate using 
thermal, UV, chemical, or mechanical bonding procedures. A typical array may be produced by 
hand or using available methods and machines and contain any appropriate number of elements. 

1 5 After hybridization, nonhybridized probes are removed and a scanner used to determine the levels 
and patterns of fluorescence. The degree of complementarity and the relative abundance of each 
probe which hybridizes to an element on the microarray may be assessed through analysis of the 
scanned images. 

Full-length cDNAs, Expressed Sequence Tags (ESTs), or fragments thereof may 
20 comprise the elements of the microarray. Fragments suitable for hybridization can be selected 
using software well known in the art such as LASERGENE software (DNASTAR). Full-length 
cDNAs, ESTs, or fragments thereof corresponding to one of the nucleotide sequences of the 
present invention, or selected at random from a cDNA library relevant to the present invention, are 
arranged on an appropriate substrate, e.g., a glass slide. The cDNA is fixed to the slide using, e.g., 
25 UV cross-linking followed by thermal and chemical treatments and subsequent drying. (See, e.g., 
Schena, M. et al. (1995) Science 270:467-470; Shalon, D. et al. (1996) Genome Res. 6:639-645.) 
Fluorescent probes are prepared and used for hybridization to the elements on the substrate. The 
substrate is analyzed by procedures described above. 

VIII. Complementary Polynucleotides 

30 Sequences complementary to the HGPRP-encoding sequences, or any parts thereof, are 

used to detect, decrease, or inhibit expression of naturally occurring HGPRP. Although use of 
oligonucleotides comprising from about 15 to 30 base pairs is described, essentially the same 
procedure is used with smaller or with larger sequence fragments. Appropriate oligonucleotides 
are designed using OLIGO 4.06 software (National Biosciences) and the coding sequence of 

35 HGPRP. To inhibit transcription, a complementary oligonucleotide is designed from the most 
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unique 5' sequence and used to prevent promoter binding to the coding sequence. To inhibit 
translation, a complementary oligonucleotide is designed to prevent ribosomal binding to the 
HGPRP-encoding transcript. 

IX. Expression of HGPRP 

5 Expression and purification of HGPRP is achieved using bacterial or virus-based 

expression systems. For expression of HGPRP in bacteria, cDNA is subcioned into an appropriate 
vector containing an antibiotic resistance gene and an inducible promoter that directs high levels 
of cDNA transcription. Examples of such promoters include, but are not limited to, the trp-lac 
(tac) hybrid promoter and the T5 or T7 bacteriophage promoter in conjunction with the lac 

1 0 operator regulatory element. Recombinant vectors are transformed into suitable bacterial hosts, 
e.g., BL21(DE3). Antibiotic resistant bacteria express HGPRP upon induction with isopropyl 
beta-D-thiogalactopyranoside (IPTG). Expression of HGPRP in eukaryotic cells is achieved by 
infecting insect or mammalian cell lines with recombinant Autographica californica nuclear 
polyhedrosis virus (AcMNPV), commonly known as baculovirus. The nonessential polyhedrin 

1 5 gene of baculovirus is replaced with cDNA encoding HGPRP by either homologous 

recombination or bacterial-mediated transposition involving transfer plasmid intermediates. Viral 
infectivity is maintained and the strong polyhedrin promoter drives high levels of cDNA 
transcription. Recombinant baculovirus is used to infect Spodoptera frusiperda (Sf9) insect cells 
in most cases, or human hepatocytes, in some cases. Infection of the latter requires additional 

20 genetic modifications to baculovirus. (See Engelhard, E.K. et al. (1994) Proc. Natl. Acad. Sci. 
USA 91:3224-3227; Sandig, V. et al. (1996) Hum. Gene Then 7:1937-1945.) 

In most expression systems, HGPRP is synthesized as a fusion protein with, e.g., 
glutathione S-transferase (GST) or a peptide epitope tag, such as FLAG or 6-His ? permitting rapid, 
single-step, affinity-based purification of recombinant fusion protein from crude cell lysates. 

25 GST, a 26-kilodalton enzyme from Schistosoma iaoonicum , enables the purification of fusion 
proteins on immobilized glutathione under conditions that maintain protein activity and 
antigenicity (Amersham Pharmacia Biotech). Following purification, the GST moiety can be 
proteolytically cleaved from HGPRP at specifically engineered sites. FLAG, an 8-amino acid 
peptide, enables immunoaffinity purification using commercially available monoclonal and 

30 polyclonal anti-FLAG antibodies (Eastman Kodak). 6-His, a stretch of six consecutive histidine 
residues, enables purification on metal-chelate resins (QIAGEN). Methods for protein expression 
and purification are discussed in Ausubel (1995, surjra, ch 10 and 16). Purified HGPRP obtained 
by these methods can be used directly in the following activity assay. 

X, Demonstration of HGPRP Activity 

35 GPCR activity of HGPRP is determined in a ligand-binding assay using candidate ligand 
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molecules in the presence of ,25 I-labeled HGPRP. HGPRP is labeled with ,25 I Bolton-Hunter 
reagent. (See, e.g., Bolton, A.E. and W.M. Hunter (1973) Biochem. J. 133:529-539.) Candidate 
ligand molecules previously arrayed in the wells of a multi-well plate are incubated with the 
labeled HGPRP, washed, and any wells with labeled HGPRP complex are assayed. Data obtained 
5 using different concentrations of HGPRP are used to calculate values for the number, affinity, and 
association of HGPRP with the ligand molecules. 
XI. Functional Assays 

HGPRP function is assessed by expressing the sequences encoding HGPRP at 
physiologically elevated levels in mammalian cell culture systems. cDNA is subcloned into a 

1 0 mammalian expression vector containing a strong promoter that drives high levels of cDNA 
expression. Vectors of choice include pCMV SPORT plasmid (Life Technologies) and pCR3.1 
plasmid (Invitrogen, Carlsbad CA), both of which contain the cytomegalovirus promoter. 5-10 fxg 
of recombinant vector are transiently transfected into a human cell line, preferably of endothelial 
or hematopoietic origin, using either liposome formulations or electroporation. 1-2 (xg of an 

1 5 additional plasmid containing sequences encoding a marker protein are co-transfected. Expression 
of a marker protein provides a means to distinguish transfected cells from nontransfected cells and 
is a reliable predictor of cDNA expression from the recombinant vector. Marker proteins of 
choice include, e.g., Green Fluorescent Protein (GFP; Ciontech), CD64, or a CD64-GFP fusion 
protein. Flow cytometry (FCM), an automated, laser optics-based technique, is used to identify 

20 transfected cells expressing GFP or CD64-GFP, and to evaluate properties, for example, their 
apoptotic state. FCM detects and quantifies the uptakeof fluorescent molecules that diagnose 
events preceding or coincident with cell death. These events include changes in nuclear DNA 
content as measured by staining of DNA with propidium iodide; changes in cell size and 
granularity as measured by forward light scatter and 90 degree side light scatter; down-regulation 

25 of DNA synthesis as measured by decrease in bromodeoxyuridine uptake: alterations in 
expression of cell surface and intracellular proteins as measured by reactivity with specific 
antibodies; and alterations in plasma membrane composition as measured by the binding of 
fluorescein-conjugated Annexin V protein to the cell surface. Methods in flow cytometry are 
discussed in Ormerod, M G (1994^ Flow Cytometry . Oxford, New York NY. 

30 The influence of HGPRP on gene expression can be assessed using highly purified 

populations of cells transfected with sequences encoding HGPRP and either CD64 or CD64-GFP. 
CD64 and CD64-GFP are expressed on the surface of transfected cells and bind to conserved 
regions of human immunoglobulin G (IgG). Transfected cells are efficiently separated from 
nontransfected cells using magnetic beads coated with either human IgG or antibody against CD64 

35 (DYNAL, Lake Success NY). mRNA can be purified from the cells using methods well known 
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by those of skill in the art. Expression of mRNA encoding HGPRP and other genes of interest can 
be analyzed by northern analysis or microarray techniques. 
XII. Production of HGPRP Specific Antibodies 

HGPRP substantially purified using polyacrylamide gel electrophoresis (PAGE; see, e.g., 
5 Harrington. M.G. ( 1 990) Methods Enzymol. 1 82:488-495), or other purification techniques, is 
used to immunize rabbits and to produce antibodies using standard protocols. 

Alternatively, the HGPRP amino acid sequence is analyzed using LASERGENE software 
(DNASTAR) to determine regions of high immunogenicity, and a corresponding oligopeptide is 
synthesized and used to raise antibodies by means known to those of skill in the art. Methods for 
10 selection of appropriate epitopes, such as those hear the C-terrninus or in hydrophilic regions are 
well described in the art. (See, e.g., Ausubel, 1995, supra , ch. 11.) 

Typically, oligopeptides 15 residues in length are synthesized using an ABI 431 A peptide 
synthesizer (PE Biosystems) using fmoc-chemistry and coupled to KLH (Sigma-Aldrich, St. 
Louis MO) by reaction with N-maleimidobenzoyl-N-hydroxysuccinimide ester (MBS) to increase 

15 immunogenicity. (See, e.g.. Ausubel. 1995. supra .) Rabbits are immunized with the oligopeptide- 
KLH complex in complete Freund's adjuvant. Resulting antisera are tested for antipeptide activity 
by, for example, binding the peptide to plastic, blocking with 1% BSA, reacting with rabbit 
antisera, washing, and reacting with radio-iodinated goat anti-rabbit IgG. 
XIIL Purification of Naturally Occurring HGPRP Using Specific Antibodies 

20 Naturally occurring or recombinant HGPRP is substantially purified by immunoaffinity 

chromatography using antibodies specific for HGPRP. An immunoaffinity column is constructed 
by covalently coupling anti-HGPRP antibody to an activated chromatographic resin, such as 
CNBr-activated SEPHAROSE (Amersham Pharmacia Biotech). After the coupling, the resin is 
blocked and washed according to the manufacturer's instructions. 

25 Media containing HGPRP are passed over the immunoaffinity column, and the column is 

washed under conditions that allow the preferential absorbance of HGPRP (e.g., high ionic 
strength buffers in the presence of detergent). The column is eluted under conditions that disrupt 
antibody/HGPRP binding (e.g., a buffer of pH 2 to pH 3, or a high concentration of a chaotrope, 
such as urea or thiocyanate ion), and HGPRP is collected. 

30 XIV. Identification of Molecules Which Interact with HGPRP 

HGPRP, or biologically active fragments thereof, are labeled with 125 I Bolton-Hunter 
reagent. (See, e.g., Bolton and Hunter, supra .) Candidate molecules previously arrayed in the 
wells of a multi-well plate are incubated with the labeled HGPRP, washed, and any wells with 
labeled HGPRP complex are assayed. Data obtained using different concentrations of HGPRP are 

35 used to calculate values for the number, affinity, and association of HGPRP with the candidate 
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molecules. 

Various modifications and variations of the described methods and systems of the 
invention will be apparent to those skilled in the art without departing from the scope and spirit of 
the invention. Although the invention has been described in connection with specific preferred 
embodiments, it should be understood that the invention as claimed should not be unduly limited 
to such specific embodiments. Indeed, various modifications of the described modes for carrying 
out the invention which are obvious to those skilled in molecular biology or related fields are 
intended to be within the scope of the following claims. 
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What is claimed is: 

, A substantially purified polypeptide comprising an amino acid sequence selected 
from the group consisting of SEQ ID NO: 1, SEQ ID NO:2, SEQ ID NO:3, SEQ ID NO:4, and 
SEQ ID NO:6, and fragments thereof. 
5 2 . A substantially purified variant having at least 90% amino acid sequence identity 

to the amino acid sequence of claim 1 . 

3. An isolated and purified polynucleotide encoding the polypeptide of claim 1 . 

4. An isolated and purified polynucleotide variant having at least 70% 
polynucleotide sequence identity to the polynucleotide of claim 3. 

, 0 5. An isolated and purified polynucleotide which hybridizes under stringent 

conditions to the polynucleotide of claim 3. 

6. An isolated and purified polynucleotide having a sequence which is 

complementary to the polynucleotide of claim 3. 

7. A method for detecting a polynucleotide, the method comprising the steps of: 

, 5 (a) hybridizing the polynucleotide of claim 6 to at least one nucleic acid in a 

sample, thereby forming a hybridization complex; and 

(b) detecting the hybridization complex, wherein the presence of the 

hybridization complex correlates with the presence of the polynucleotide in the sample. 

g. The method of claim 7 further comprising amplifying the polynucleotide prior to 
20 hybridization. 

9 An isolated and purified polynucleotide comprising a polynucleotide sequence 
selected from the group consisting of SEQ ID NO:7, SEQ ID NO:8, SEQ ID NO:9, SEQ ID 
NO: 1 0, and SEQ ID NO: 12, and fragments thereof. 

1 0. An isolated and purified polynucleotide variant having at least 70% 
25 polynucleotide sequence identity to the polynucleotide of claim 9. 

11. An isolated and purified polynucleotide having a sequence which is 
complementary to the polynucleotide of claim 9. 

12. An expression vector comprising at least a fragment of the polynucleotide of 

claim 3. 

30 13. A host cell comprising the expression vector of claim 12. 

14. A method for producing a polypeptide, the method comprising the steps of: 

a) culturing the host cell of claim 13 under conditions suitable for the 

expression of the polypeptide; and 

b ) recovering the polypeptide from the host cell culture. 
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1 5. A pharmaceutical composition comprising the polypeptide of claim 1 in 
conjunction with a suitable pharmaceutical carrier. 

1 6. A purified antibody which specifically binds to the polypeptide of claim 1 . 

1 7. A purified agonist of the polypeptide of claim 1 . 

5 1 8. A purified antagonist of the polypeptide of claim 1 . 

19. A method for treating or preventing a disorder associated with decreased 
expression or activity of HGPRP, the method comprising administering to a subject in need of 
such treatment an effective amount of the pharmaceutical composition of claim 15. 

20. A method for treating or preventing a disorder associated with increased 

1 0 expression or activity of HGPRP, the method comprising admin istering to a subject in need of 
such treatment an effective amount of the antagonist of claim 18. 



56 



WO 00/15793 



PCT/US99/20958 



SEQUENCE LISTING 



<110> INCYTE PHARMACEUTICALS, INC. 
BANDMAN, Olga 
LAL, Preeti 
TANG, Y. Tom 
CORLEY, Neil C. 
GUEGLER , Karl J. 
GORGONE, Gina A. 
BAUGHN, Mariah R. 

<120> HUMAN GPCR PROTEINS 

<130> PF-0597 PCT 

<14 0> To Be Assigned 
<141> Herewith 

<150> 09/156,513 
<151> 1998-09-17 



<160> 12 



<170> PERL Program 

<210> 1 
<211> 441 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc__f eature 

<223> incyte ID No: 1258981CD1 



<400> 1 


















Met 


Ala 


He 


His 


Lys 


Ala 


Leu 


Val 


Met 


Cys 


1 








5 










10 


Phe 


Leu 


Phe 


Pro 


Gly 


Ala 


Trp 


Ala 


Gin Gly 








20 










25 


Cys 


Ser 


Gin 


Gly 


Leu 
35 


Asn 


Pro 


Leu 


Tyr 


Tyr 
40 


Ser 


Gly 


Ala 


Trp 


Gly 


He 


Val 


Leu 


Glu 


Ala 








50 










55 


He 


Val 


Thr 


Thr 


Phe 


Val 


Leu 


Thr 


He 


He 








65 










70 


Pro 


Phe 


Val 


Gin 


Asp 


Thr 


Lys 


Lys 


Arg 


Ser 








80 










85 


Val 


Phe 


Phe 


Leu 


Leu Gly Thr Leu Gly 


Leu 








95 










100 


Ala 


Cys 


Val 


Val 


Lys 
110 


Pro 


Asp 


Phe 


Ser 


Thr 
115 


Phe 


Leu 


Phe 


Gly 


Val 


Leu 


Phe 


Ala 


He 


Cys 






125 










130 


Ala 


His 


Val 


Phe 


Ala 


Leu 


Asn 


Phe 


Leu 


Ala 






140 










145 


Pro 


Arg 


Gly Trp 


Val 


He 


Phe 


Thr 


Val 


Ala 



15 
Gly 

30 
Arg 

45 
Gly 

60 
Leu 

75 
Gin 

90 
Phe 
105 
Arg 
120 
Ala 
135 
Gly 
150 
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155 160 165 

val Glu Val lie lie Asn Thr Glu Trp Leu lie lie Thr Leu Val 

170 175 180 

Arg Gly ser Gly Glu Gly Gly Pro Gin Gly Asn Ser Ser Ala Gly 

185 190 195 

Trp Ala Val Ala Ser Pro Cys Ala lie Ala Asn Met Asp Phe Val 

200 205 210 

Met Ala Leu lie Tyr Val Met Leu Leu Leu Leu Gly Ala Phe Leu 

215 220 225 

Gly Ala Trp Pro Ala Leu Cys Gly Arg Tyr Lys Arg Trp Arg Lys 

230 • 235 240 

His Gly Val Phe Val Leu Leu Thr Thr Ala Thr Ser Val Ala He 

245 250 255 

Tro Val val Trp He Val Met Tyr Thr Tyr Gly Asn Lys Gin His 

260 265 270 

Asn Ser Pro Thr Trp Asp Asp Pro Thr Leu Ala He Ala Leu Ala 

275 280 285 

Ala Asn Ala Trp Ala Phe Val Leu Phe Tyr Val He Pro Glu Val 

290 295 300 

Ser Gin Val Thr Lys Ser Ser Pro Glu Gin Ser Tyr Gin Gly Asp 

305 31° 315 

Met Tyr Pro Thr Arg Gly Val Gly Tyr Glu Thr He Leu Lys Glu 

320 325 330 

Gin Lys Gly Gin Ser Met Phe Val Glu Asn Lys Ala Phe Ser Met 

335 340 345 

Asp Glu Pro val Ala Ala Lys Arg Pro Val Ser Pro Tyr Ser Gly 
P 350 355 360 

Tyr Asn Gly Gin Leu Leu Thr Ser Val Tyr Gin Pro Thr Glu Met 

365 370 375 

Ala Leu Met His Lys Val Pro Ser Glu Gly Ala Tyr Asp He He 

380 385 390 

Leu Pro Arg Ala Thr Ala Asn Ser Gin Val Met Gly Ser Ala Asn 

395 400 405 

Ser Thr Leu Arg Ala Glu Asp Met Tyr Ser Ala Gin Ser His Gin 

410 ' 415 420 

Ala Ala Thr Pro Pro Lys Asp Gly Lys Asn Ser Gin Val Phe Arg 

425 430 435 

Asn Pro Tyr Val Trp Asp 

440 

<210> 2 
<211> 353 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 1459432CD1 



Met Asp Leu Glu Ala Ser Leu Leu Pro Thr Gly Pro Asn Ala Ser 

15 10 15 

Asn Thr Ser Asp Gly Pro Asp Asn Leu Thr Ser Ala Gly Ser Pro 

20 25 30 

Pro Arg Thr Gly Ser lie Ser Tyr He Asn He He Met Pro Ser 

35 40 45 

2/13 



WO 00/15793 



PCT/US99/20958 



Val Phe 
Val He 
Asn Val 
Leu Phe 
Asn Gly 
Ala Met 
Ala Met 
Ser Thr 
Leu Leu 
Tyr Ala 
He Arg 
Tyr Gin 
Ala Ala 
Pro Thr 
Arg Thr 
Pro Tyr 
Thr Pro 
Tyr Ala 
Glu Thr 
Gin Gly 
Glu Arg 



Gly Thr 
Phe Ala 
Pro Asp 
Leu Leu 
Val Trp 
Asp Ala 
Ala He 
Lys Phe 
Trp Ala 
Arg Leu 
Leu Pro 
Phe Phe 
Tyr Val 
Ser Gin 
Ala He 
Tyr Val 
Thr Phe 
Asn Ser 
Phe Arg 
Gin Leu 
Thr Glu 



He 

50 
Val 

65 
He 

80 
Gly 

95 
His 
110 
Asn 
125 
Asp 
140 
Arg 
155 
Leu 
170 
He 
185 
Asn 
200 
Leu 
215 
Arg 
230 
Arg 
245 
Ala 
260 
Leu 
275 
Val 
290 
Cys 
305 
Lys 
320 
Arg 
335 
Ser 
350 



Cys 
Val 
Phe 
Met 
Phe 
Ser 
Arg 
Lys 
Ser 
Pro 
Pro 
Ala 
He 
Ser 
He 
Gin 
Tyr 
Leu 
Arg 
Ala 
Lys 



Leu Leu 
Lys Lys 
He He 
Pro Phe 
Gly Glu 
Gin Phe 
Tyr Leu 
Pro Ser 
Phe He 
Phe Pro 
Asp Thr 
Phe Ala 
Leu Gin 
lie Arg 
Cys Leu 
Leu Thr 
Leu Tyr 
Asn Pro 
Leu Val 
Val Ser 
Gly Thr 



Gly 
Ser 
Asn 
Met 
Thr 
Thr 
Ala 
Val 
Ser 
Gly 
Asp 
Leu 
Arg 
Leu 
Val 
Gin 
Asn 
Phe 
Leu 
Asn 



He 
55 
Lys 
70 
Leu 
85 
He 
100 
Met 
115 
Ser 
130 
Thr 
145 
Ala 
160 
He 
175 
Gly 
190 
Leu 
205 
Pro 
220 
Met 
235 
Arg 
250 
Phe 
265 
Leu 
280 
Ala 
295 
Val 
310 
Ser 
325 
Ala 
340 



He 
Leu 
Ser 
His 
Cys 
Thr 
Val 
Thr 
Thr 
Ala 
Tyr 
Phe 
Thr 
Thr 
Phe 
Ser 
Ala 
Tyr 
Val 
Gin 



Gly Asn 
His Trp 
Val Val 
Gin Leu 
Thr Leu 
Tyr He 
His Pro 
Leu Val 
Pro Val 
Val Gly 
Trp Phe 
Val Val 
Ser Ser 
Lys Arg 
Val Cys 
He Ser 
He Ser 
He Val 
Lys Pro 
Ala Ala 



Ser 
Cys 
Asp 
Met 
He 
Leu 
He 
He 
Trp 
Cys 
Thr 
He 
Val 
Val 
Trp 
Arg 
Leu 
Leu 
Ala 
Asp 



Thr 
60 
Asn 
75 
Leu 
90 
Gly 
105 
Thr 
120 
Thr 
135 
Ser 
150 
Cys 
165 
Leu 
180 
Gly 
195 
Leu 
210 
Thr 
225 
Ala 
240 
Thr 
255 
Ala 
270 
Pro 
285 
Gly 
300 
Cys 
315 
Ala 
330 
Glu 
345 



<210> 3 

<211> 333 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2214673CD1 



<400> 3 
Met Trp Ser 
1 

Leu Pro Ala 



Cys Ser Trp 
5 

Cys Gin Asp 
20 



Phe Asn Gly 
Leu Gin Leu 



Thr Gly Leu 
10 

Gly Leu Ser 
25 



Val Glu Glu 
15 

Leu Leu Ser 
30 
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Leu 


Leu 


Gly 


Leu 


Val 
35 


Val 


Gly 


Val 


Ala 


Leu 


Leu 


Val 


Leu 
50 


Ala 


Asn 


Leu 


Met 


Pro 


Asp 


Val 


Tyr 
65 


Phe 


Val 


Asn 


Leu 


Ser 


Ala 


Leu 


Ala 
80 


Pro 


Val 


His 


Arg 


Trp 


Ala 


Leu 


Trp 
95 


Ser 


Val 


Gly 


Gin 


He 


Pro 


Phe 


Asn 
110 


Val 


Ser 


Ser 


Ala 


Leu 


Leu 


Ser 


Leu 
125 


Asp 


His 


Tyr 


Thr 


Tyr 


Met 


Ala 


Ser 
140 


Val 


Tyr 


Asn 


Val 


Trp 

XT 


Gly Gly Ala 


Leu 


Leu 


Thr 










155 








Tvr 


He 


Cys 


Ser 


His 
170 


Val 


Ser 


Thr 


Met 


Gin 


Asn 


Ala 


Glu 
185 


Ala 


Ala 


Asp 


Tvr 


Val 


Val 


Pro 


Ala 


Leu 


Ala 


Thr 








200 








Ser 


Arg 


Val 


Arg 


Arg 
215 


Glu 


Asp 


Thr 


Arcr 


Leu 


Glu 


Pro 


Ser 


Ala 


His 


Arg 








230 








Thr 


Gin 


Phe 


Gly 


Leu 
245 


Trp 


Thr 


Pro 


His 


Thr 


Gly 


He 


He 


Ser Arg Gly 










260 








Leu 


Gly 


Leu 


Leu 


His 
275 


Phe 


Val 


Lys 


Phe 


Ser 


Ser 


Ser 


Phe 
290 


Val 


Thr 


Pro 


Gin 


Ser 


Phe 


Pro 


Ser 
305 


Lys 


Leu 


Gin 


Cys Gly Asp Arg 


His 


Cys 


Ser 


Pro 










320 








Val 


Leu 


Ala 













Pro 


Val 


Gly 


Leu 


Cys 


Tyr 


Asn 




40 










45 


His 


Ser 


Lys 


Ala 


Ser 


Met 


Thr 




55 










60 


Met 


Ala 


Val 


Ala 


Gly 


Leu 


Val 




70 










75 


Leu 


Leu 


Gly 


Pro 


Pro 


Ser 


Ser 




85 










90 


Gly Glu 


Val 


His 


Val 


Ala 


Leu 




100 










105 


Leu 


Val 


Ala 


Met 


Tyr 


Ser 


Thr 




115 










120 


He 


Glu 


Arg 


Ala 


Leu 


Pro 


Arg 




130 










135 


Thr 


Arg 


His 


Val 


Cys 


Gly 


Phe 




145 










150 


Ser 


Phe 


Ser 


Ser 


Leu 


Leu 


Phe 




160 










165 


Arg 


Ala 


Leu 


Glu 


Cys 


Ala 


Lys 


175 










180 


Ala 


Thr 


Leu 


Val 


Phe 


He 


Gly 




190 










195 


Leu 


Tyr 


Ala 


Leu 


Val 


Leu 


Leu 




205 










210 


Pro 


Leu 


Asp 


Arg 


Asp 


Thr 


Gly 




220 










225 


Leu 


Leu 


Val 


Ala 


Thr 


Val 


Cys 




235 










240 


His 


Tyr 


Leu 


He 


Leu 


Leu 


Gly 




250 










255 


Lys 


Pro 


Val 


Asp 


Ala 


His 


Tyr 


265 










270 


Asp 


Phe 


Ser 


Lys 


Leu 


Leu 


Ala 


280 










285 


Leu 


Leu 


Tyr Arg 


Tyr 


Met 


Asn 




295 










300 


Arg 


Leu 


Met 


Lys 


Lys 


Leu 


Pro 


310 










315 


Asp 


His 


Met 


Gly Val 


Gin 


Gin 


325 










330 



<210> 4 
<211> 396 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> misc_feature 

<223> Incyte ID No: 2488822CD1 



<400> 4 
Met Phe Arg 
1 

Phe Gin Tyr 



Pro Leu Val 
5 

Cys Gly Tyr 
20 



Asn Leu Ser 
Ala Pro His 



His He Tyr 
10 

Val Arg Ser 
25 



Phe Lys Lys 
15 

Cys Lys Pro 
30 
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Asn 


Thr 


Asp 


Gly 


He 


Ser 


Ser 


Leu 


Glu 


Asn 


Leu 


Leu 


Ala 


Ser 


He 








35 










40 










A C 

4 b 


He 


Gin 


Arg 


Val 


Phe 


Val 


Trp 


Val 


Val 


O /-i -v 

ser 




V ct _L 


1 ill 




Phe 






50 










55 










O U 


Gly 


Asn 


He 


Phe 


Val 
65 


He 


Cys 


Met 


Arg 


Pro 
70 


Tyr 


lie 




Ser 


Glu 
/ b 


Asn 


Lys 


Leu 


Tyr 


Ala 


Met 


Ser 


He 


lie 


Ser 


Leu 






Ala 


Asp 








80 










85 












Cys 


Leu 


Met 


Gly 


He 


Tyr 


Leu 


Phe 


val 


He Gly 


y 


Phe 


Asp 


Leu 








95 










100 










1 nc 


Lys 


Phe 


Arg 


Gly 


Glu 


Xyr 


Asn 


Lys 


His 


Ala 


Gin 


Leu 


Trp 


Mot- 


m n 

\J X u 






110 










115 










TOO 


Ser 


Thr 


His 


Cys 


Gin 


Leu 


Val 


Gly Ser 


Leu 


Ala 


lie 


Leu 


Ser 


Thr 






125 










130 










1J b 


Glu 


Val 


Ser 


Val 


Leu 


Leu 


Leu 


Thr 


Phe 


Leu 


Thr 


Leu 


Glu 


Lys 


Tyr 








140 










145 










1 bU 


He 


Cys 


He 


Val 


Tyr 


Pro 


Phe 


Arg 


Cys 


Val 


Arg 


Pro 


Gly Lys 


Cys 






155 










160 










lob 


Arg 


Thr 


He 


Thr 


Val 
170 


Leu 


He 


Leu 


He 


Trp 
175 


He 


Thr Gly Phe 


He 
180 


Val 


Ala 


Phe 


He 


Pro 


Leu 


Ser 


Asn 


Lys 


Glu 


Phe 


Phe 


Lys 


Asn 


Tyr 








185 










190 










195 


Tyr 


Ala 


Pro 


Asn 


Gly Val 
200 


Cys 


Phe 


Pro 


Leu 
205 


His 


Ser 


Glu 


Asp 


Thr 
210 


Glu 


Ser 


He 


Gly 


Ala 


Gin 


He 


Tyr 


Ser 


Val 


Ala 


lie 


Phe 


Leu Gly 






215 










220 










225 


He 


Asn 


Leu 


Ala 


Ala 


Phe 


He 


lie 


He 


Val 


Phe 


Ser 


Tyr Gly 


Ser 








230 










235 










240 


Met 


Phe 


Tyr 


Ser 


Val 


His 


Gin 


Ser 


Ala 


He 


Thr 


Ala 


Thr 


Glu 


He 






245 










250 










255 


Arg 


Asn 


Gin 


Val 


Lys 


Lys 


Glu 


Met 


He 


Leu 


Ala 


Lys 


Arg 


Phe 


Phe 








260 










265 










270 


Phe 


He 


Val 


Phe 


Thr Asp 


Ala 


Leu Cys 


Trp 


He 


Pro 


He 


Phe 


Val 








275 










280 










285 


Val 


Lys 


Phe 


Leu 


Ser 


Leu 


Leu 


Gin 


Val 


Glu 


He 


Pro Gly Thr 


He 








290 










295 










300 


Thr 


Ser 


Trp 


Val 


Val 


He 


Phe 


He 


Leu 


Pro 


He 


Asn 


Ser 


Ala 


Leu 






305 










310 










315 


Asn 


Pro 


He 


Leu 


Tyr 
320 


Thr 


Leu 


Thr 


Thr 


Arg 
325 


Pro 


Phe 


Lys 


Glu 


Met 
330 


He 


His 


Arg 


Phe 


Trp 


Tyr 


Asn 


Tyr Arg Gin Arg 


Lys 


Ser 


Met 


Asp 








335 










340 










345 


Ser 


Lys Gly Gin 


Lys 


Thr 


Tyr 


Ala 


Pro 


Ser 


Phe 


He 


Trp 


Val 


Glu 










350 










355 










360 


Met 


Trp 


Pro 


Leu 


Gin 


Glu 


Met 


Pro 


Pro 


Glu 


Leu 


Met 


Lys 


Pro 


Asp 








365 










370 










375 


Leu 


Phe 


Thr 


Tyr 


Pro 


Cys 


Glu 


Met 


Ser 


Leu 


He 


Ser 


Gin 


Ser 


Thr 








380 










385 










390 


Arg 


Leu 


Asn 


Ser 


Tyr 
395 


Ser 





















<210> 5 
<211> 403 
<212> PRT 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 

<223> Incyte ID No: 2705201CD1 

<400> 5 

Met Phe Val Ala Ser Glu Arg Lys Met Arg Ala His Gin Val Leu 
15 10 15 

Thr Phe Leu Leu Leu Phe Val lie Thr Ser Val Ala Ser Glu Asn 
20 25 30 

Ala Ser Thr Ser Arg Gly Cys Gly Leu Asp Leu Leu Pro Gin Tyr 
35 40 45 

Val Ser Leu Cys Asp Leu Asp Ala lie Trp Gly lie Val Val Glu 
50 55 60 

Ala Val Ala Gly Ala Gly Ala Leu He Thr Leu Leu Leu Met Leu 
65 70 75 

He Leu Leu Val Arg Leu Pro Phe He Lys Glu Lys Glu Lys Lys 
80 85 90 

Ser Pro Val Gly Leu His Phe Leu Phe Leu Leu Gly Thr Leu Gly 
95 100 105 

Leu Phe Gly Leu Thr Phe Ala Phe He He Gin Glu Asp Glu Thr 

110 115 120 

He Cys Ser Val Arg Arg Phe Leu Trp Gly Val Leu Phe Ala Leu 

125 130 135 

Cys Phe Ser Cys Leu Leu Ser Gin Ala Trp Arg Val Arg Arg Leu 

140 145 150 

Val Arg His Gly Thr Gly Pro Ala Gly Trp Gin Leu Val Gly Leu 

155 160 165 

Ala Leu Cys Leu Met Leu Val Gin Val He He Ala Val Glu Trp 

170 175 180 

Leu Val Leu Thr Val Leu Arg Asp Thr Arg Pro Ala Cys Ala Tyr 

185 190 195 

Glu Pro Met Asp. Phe Val Met Ala Leu He Tyr Asp Met Val Leu 

200 205 210 

Leu Val Val Thr Leu Gly Leu Ala Leu Phe Thr Leu Cys Gly Lys 

215 220 225 

Phe Lys Arg Trp Lys Leu Asn Gly Ala Phe Leu Leu He Thr Ala 

230 * 235 240 

Phe Leu Ser Val Leu He Trp Val Ala Trp Met Thr Met Tyr Leu 

245 250 255 

Phe Gly Asn Val Lys Leu Gin Gin Gly Asp Ala Trp Asn Asp Pro 

260 265 270 

Thr Leu Ala He Thr Leu Ala Ala Ser Gly Trp Val Phe Val He 

275 280 285 

Phe His Ala He Pro Glu He His Cys Thr Leu Leu Pro Ala Leu 

290 295 300 

Gin Glu Asn Thr Pro Asn Tyr Phe Asp Thr Ser Gin Pro Arg Met 

305 310 315 

Arg Glu Thr Ala Phe Glu Glu Asp Val Gin Leu Pro Arg Ala Tyr 

320 325 330 

Met Glu Asn Lys Ala Phe Ser Met Asp Glu His Asn Ala Ala Leu 

335 340 345 

Arg Thr Ala Gly Phe Pro Asn Gly Ser Leu Gly Lys Arg Pro Ser 

350 355 360 

Gly Ser Leu Gly Lys Arg Pro Ser Ala Pro Phe Arg Ser Asn Val 

365 370 375 

Tyr Gin Pro Thr Glu Met Ala Val Val Leu Asn Gly Gly Thr He 

380 385 390 
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Pro Thr Ala Pro Pro Ser His Thr Gly Arg His Leu Trp 
395 400 

<210> 6 

<211> 807 

<212> PRT 

<213> Homo sapiens 



<220> 

<221> misc_f eature 

<223> Incyte ID No: 3036563CD1 



<400> 6 

Met Gly Thr Tyr His Cys lie Phe Arg Tyr Lys Asn Ser Tyr Ser 
1 5 10 15 

He Ala Thr Lys Asp Val He Val His Pro Leu Pro Leu Lys Leu 
20 25 30 

Asn He Met Val Asp Pro Leu Glu Ala Thr Val Ser Cys Ser Gly 
35 40 45 

Ser His His He Lys Cys Cys He Glu Glu Asp Gly Asp Tyr Lys 
50 55 60 

Val Thr Phe His Met Gly Ser Ser Ser Leu Pro Ala Ala Lys Glu 
65 70 75 

Val Asn Lys Lys Gin Val Cys Tyr Lys His Asn Phe Asn Ala Ser 
80 85 90 

Ser Val Ser Trp Cys Ser Lys Thr Val Asp Val Cys Cys His Phe 
95 100 105 

Thr Asn Ala Ala Asn Asn Ser Val Trp Ser Pro Ser Met Lys Leu 

110 H5 120 

Asn Leu Val Pro Gly Glu Asn He Thr Cys Gin Asp Pro Val He 

125 130 135 

Gly Val Gly Glu Pro Gly Lys Val He Gin Lys Leu Cys Arg Phe 

140 145 150 

Ser Asn Val Pro Ser Ser Pro Glu Ser Pro He Gly Gly Thr He 

155 160 165 

Thr Tyr Lys Cys Val Gly Ser Gin Trp Glu Glu Lys Arg Asn Asp 

170 175 180 

Cys He Ser Ala Pro He Asn Ser Leu Leu Gin Met Ala Lys Ala 

185 190 195 

Leu He Lys Ser Pro Ser Gin Asp Glu Met Leu Pro Thr Tyr Leu 

200 205 210 

Lys Asp Leu Ser He Ser He Gly Lys Ala Glu His Glu He Ser 

215 220 225 

Ser Ser Pro Gly Ser Leu Gly Ala He He Asn He Leu Asp Leu 

230 235 240 

Leu Ser Thr Val Pro Thr Gin Val Asn Ser Glu Met Met Thr His 

245 250 255 

Val Leu Ser Thr Val Asn He He Leu Gly Lys Pro Val Leu Asn ■ 

260 265 270 

Thr Trp Lys Val Leu Gin Gin Gin Trp Thr Asn Gin Ser Ser Gin 

275 280 285 

Leu Leu His Ser Val Glu Arg Phe Ser Gin Ala Leu Gin Ser Gly 

290 295 300 

Asp Ser Pro Pro Leu Ser Phe Ser Gin Thr Asn Val Gin Met Ser 

305 310 315 

Ser Met Val lie Lys Ser Ser His Pro Glu Thr Tyr Gin Gin Arg 

320 325 330 
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Phe 


Val 


Phe 


Pro 


Tyr 


Phe 


Asp 


Leu 


Trp 


Gly 


Asn 


Val 


Val He 


Asp 










335 










340 








345 


Lys 


Ser 


Tyr 


Leu 


Glu 


Asn 


Leu 


Gin 


Ser 


Asp 


Ser 


Ser 


He Val 


Thr 








350 










355 








360 


Met 


Ala 


Phe 


Pro 


Thr 


Leu 


Gin 


Ala 


He 


Leu 


Ala 


Gin 


Asp He 


Gin 










365 










370 








375 


Glu 


Asn 


Asn 


Phe 


Ala 


Glu 


Ser 


Leu 


Val 


Met 


Thr 


Thr 


Thr Val 


Ser 










380 










385 








390 


His 


Asn 


Thr 


Thr 


Met 


Pro 


Phe 


Arg 


He 


Ser 


Met 


Thr 


Phe Lys 


Asn 










395 










400 








405 


Asn 


Ser 


Pro 


Ser Gly Gly Glu 


Thr 


Lys 


Cys 


Val 


Phe 


Trp Asn 


Phe 










410 










415 








420 


Arg 


Leu 


Ala Asn Asn Thr Gly Gly Trp Asp 


Ser 


Ser 


Gly Cys 


Tyr 








425 










430 








435 


Val 


Glu 


Glu Gly Asp Gly Asp 


Asn 


Val 


Thr 


Cys 


He 


Cys Asp 


His 










440 










445 








450 


Leu 


Thr 


Ser 


Phe 


Ser 


He 


Leu 


Met 


Ser 


Pro 


Asp 


Ser 


Pro Asp 


Pro 










455 










460 








465 


Ser 


Ser 


Leu 


Leu Gly 


He 


Leu 


Leu Asp 


He 


He 


Ser 


Tyr Val 


Gly 










470 










475 








480 


Val 


Gly 


Phe 


Ser 


lie 


Leu 


Ser 


Leu 


Ala 


Ala 


Cys 


Leu 


Val Val 


Glu 








485 










490 








495 


Ala 


Val 


Val 


Trp 


Lys 


Ser 


Val 


Thr 


Lys 


Asn 


Arg 


Thr 


Ser Tyr 


Met 










500 










505 








510 


Arg 


His 


Thr 


Cys 


He 


Val 


Asn 


He 


Ala 


Ala 


Ser 


Leu 


Leu Val 


Ala 








515 










520 








525 


Asn 


Thr 


Trp 


Phe 


He 


Val 


Val 


Ala 


Ala 


He 


Gin 


Asp Asn Arg 


Tyr 










530 










535 








540 


He 


Leu 


Cys 


Lys 


Thr 


Ala 


Cys 


Val 


Ala 


Ala 


Thr 


Phe 


Phe He 


His 










545 










550 








555 


Phe 


Phe 


Tyr 


Leu 


Ser 


Val 


Phe 


Phe 


Trp 


Met 


Leu 


Thr Leu Gly 


Leu 










560 










565 








570 


Met 


Leu 


Phe 


Tyr 


Arg 


Leu 


Val 


Phe 


He 


Leu 


His 


Glu 


Thr Ser 


Arg 










575 










580 








585 


Ser 


Thr 


Gin 


Lys 


Ala 


He 


Ala 


Phe 


Cys 


Leu Gly Tyr Gly Cys 


Pro 










590 










595 








600 


Leu 


Ala 


He 


Ser 


Val 


He Thr Leu Gly Ala 


Thr 


Gin 


Pro Arg 


Glu 










605 










610 








615 


Val 


Tyr 


Thr Arg 


Lys 


Asn 


Val 


Cys 


Trp 


Leu 


Asn 


Trp 


Glu Asp 


Thr 










620 










625 








630 


Lys 


Ala 


Leu 


Leu 


Ala 


Phe 


Ala 


He 


Pro 


Ala 


Leu 


He 


He Val 


Val 








635 










640 








645 


Val 


Asn 


He 


Thr 


He 


Thr 


He 


Val 


Val 


He 


Thr 


Lys 


He Leu 


Arg 










650 










655 








660 


Pro 


Ser 


He Gly Asp 


Lys 


Pro 


Cys 


Lys 


Gin 


Glu 


Lys 


Ser Ser 


Leu 










665 










670 








675 


Phe 


Gin 


He 


Ser 


Lys 


Ser 


He Gly Val 


Leu 


Thr 


Pro 


Leu Leu 


Gly 










680 










685 








690 


Leu 


Thr 


Trp Gly Phe Gly Leu Thr Thr 


Val 


Phe 


Pro Gly Thr 


Asn 










695 










700 








705 


Leu 


Val 


Phe 


His 


He 


He 


Phe 


Aia 


He 


Leu 


Asn 


Val 


Phe Gin 


Gly 










710 










715 








720 


Leu 


Phe 


He 


Leu 


Leu 


Phe Gly Cys 


Leu 


Trp 


Asp 


Leu 


Lys Val 


Gin 










725 










730 








735 


Glu 


Ala 


Leu 


Leu 


Asn 


Lys 


Phe 


Ser 


Leu 


Ser 


Arg 


Trp 


Ser Ser 


Gin 










740 










745 








750 


His 


Ser 


Lys 


Ser 


Thr 


Ser Leu Gly 


Ser 


Ser 


Thr 


Pro 


Val Phe 


Ser 



8/13 



WO 00/15793 



PCT/US99/20958 



Leu Glu Asn 



Thr Gly Thr 



Met Ser Ser 



755 
Pro lie 

770 
Tyr Asn 

785 
Ser Ser 

800 



Val Ser Thr Pro 



Ser Ala Ser Ser 



Ser Arg Arg Phe 



760 765 
Asn Asn Leu Phe Gly Lys 
775 780 
Glu Ala Thr Ser Ser Ser 
790 795 
Leu Leu Asn 



805 



<210> 7 

<211> 1824 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 1258981CB1 

<400> 7 

cggctcgagc cctcaccagc cggaaagtac gagtcggctc agcctggagg gacccaacca 60 
gagcctggcc tgggagccag gatggccatc cacaaagcct tggtgatgtg cctgggactg 120 
cctctcttcc tgttcccagg ggcctgggcc cagggccatg tcccacccgg ctgcagccaa 180 
ggcctcaacc ccctgtacta caacctgtgt gaccgctctg gggcgtgggg catcgtcctg 240 
gaggccgtgg ctggggcggg cattgtcacc acgtttgtgc tcaccatcat cctggtggcc 300 
agcctcccct ttgtgcagga caccaagaaa cggagcctgc tggggaccca ggtattcttc 360 
cttctgggga ccctgggcct cttctgcctc gtgtttgcct gtgtggtgaa gcccgacttc 420 
tccacctgtg cctctcggcg cttcctcttt ggggttctgt tcgccatctg cttctcttgt 480 
ctggcggctc acgtctttgc cctcaacttc ctggcccgga agaaccacgg gccccggggc 540 
tgggtgatct tcactgtggc tctgctgctg accctggtag aggtcatcat caatacagag 600 
tggctgatca tcaccctggt tcggggcagt ggcgagggcg gccctcaggg caacagcagc 660 
gcaggctggg ccgtggcctc cccctgtgcc atcgccaaca tggactttgt catggcactc 720 
atctacgtca tgctgctgct gctgggtgcc ttcctggggg cctggcccgc cctgtgtggc 780 
cgctacaagc gctggcgtaa gcatggggtc tttgtgctcc tcaccacagc cacctccgtt 840 
gccatatggg tggtgtggat cgtcatgtat acttacggca acaagcagca caacagtccc 900 
acctgggatg accccacgct ggccatcgcc ctcgccgcca atgcctgggc cttcgtcctc 960 
ttctacgtca tccccgaggt ctcccaggtg accaagtcca gcccagagca aagctaccag 1020 
ggggacatgt accccacccg gggcgtgggc tatgagacca tcctgaaaga gcagaagggt 1080 
cagagcatgt tcgtggagaa caaggccttt tccatggatg agccggttgc agctaagagg 114 0 
ccggtgtcac catacagcgg gtacaatggg cagctgctga ccagtgtgta ccagcccact 1200 
gagatggccc tgatgcacaa agttccgtcc gaaggagctt acgacatcat cctcccacgg 1260 
gccaccgcca acagccaggt gatgggcagt gccaactcga ccctgcgggc tgaagacatg 1320 
tactcggccc agagccacca ggcggccaca ccgccgaaag acggcaagaa ctctcaggtc 1380 
tttagaaacc cctacgtgtg ggactgagtc agcggtggcg aggagaggcg gtcggatttg 1440 
gggagggccc tgaggacctg gccccgggca agggactctc caggctcctc ctccccctgg 1500 
caggcccagc aacatgtgcc ccagatgtgg aagggcctcc ctctctgcca gtgtttgggt 1560 
gggtgtcatg ggtgtcccca cccactcctc agtgtttgtg gagtcgagga gccaacccca 1620 
gcctcctgcc aggatcacct cggcggtcac actccagcca aatagtgttc tcggggtggt 1680 
ggctgggcag cgcctatgtt tctctggaga ttcctgcaac ctcaagagac ttcccaggcg 1740 
ctcaggcctg gatcttgctc ctctgtgagg aacaagggtg cctaataaat acatttctgc 1800 
tttattaact cttaaaaaaa aaaa 1824 

<210> 8 

<211> 2152 

<212> DNA 

<213> Homo sapiens 
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<220> 

<221> misc_f eature 

<223> Incyte ID No: 1459432CB1 

<400> 8 

ttatgtctgg tcgactctga attgggcttg gaggcggcac ggctgccagg ctacggaggt 60 
agaccccctt cccaactgcg gggcttgcgc tccgggacaa ggtggcaggc gctggaggct 120 
gccgcagcct gcgtgggtgg aggggagctc agctcggttg tggcagcatg cgaccggcac 180 
tggctggatg gacctggaag cctcgctgct gcccactggt cccaatgcca gcaacacctc 240 
tgatggcccc gataacctca cttcggcagg atcacctcct cgcacgggga gcatctccta 300 
catcaacatc atcatgcctt cggtgttcgg caccatctgc ctcctgggca tcatcgggaa 360 
ctccacggtc atcttcgcgg tcgtgaagaa gtccaagctg cactggtgca acaacgtccc 42 0 
cgacatcttc atcatcaacc tctcggtagt agatctcctc tttctcctgg gcatgccctt 480 
catgatccac cagctcatgg gcaatggggt gtggcacttt ggggagacca tgtgcaccct 540 
catcacggcc atggatgcca atagtcagtt caccagcacc tacatcctga ccgccatggc 600 
cattgaccgc tacctggcca ctgtccaccc catctcttcc acgaagttcc ggaagccctc 660 
tgtggccacc ctggtgatct gcctcctgtg ggccctctcc ttcatcagca tcacccctgt 720 
gtggctgtat gccagactca tccccttccc aggaggtgca gtgggctgcg gcatacgcct 780 
gcccaaccca gacactgacc tctactggtt caccctgtac cagtttttcc tggcctttgc 840 
cctgcctttt gtggtcatca cagccgcata cgtgaggatc ctgcagcgca tgacgtcctc 900 
agtggccccc acctcccagc gcagcatccg gctgcggaca aagagggtga cccgcacagc 960 
catcgccatc tgtctggtct tctttgtgtg ctgggcaccc tactatgtgc tacagctgac 1020 
ccagttgtcc atcagccgcc cgacccccac ctttgtctac ttatacaatg cggccatcag 1080 
cttgggctat gccaacagct gcctcaaccc gtttgtgtac atcgtgctct gtgagacgtt 1140 
ccgcaaacgc ttggtcctgt cggtgaagcc tgcagcccag gggcagcttc gcgctgtcag 1200 
caacgctcag gcggctgacg aggagaggac agaaagcaaa ggcacctgat acttcccctg 1260 
ccaccctgca cacctccaag tcagggcacc acaacacgcc accgggagag atgctgagaa 1320 
aaacccaaga ccgctcggga aatgcaggaa ggccgggttg tgaggggttg ttgcaatgaa 13 80 
ataaatacat tccatggggc tcacacgttg ctggggaggc ctggagtcag gtttggggtt 1440 
ttcagatatc agaaatcccc ttgggggagc aggatgagac ctttggatag aacagaagct 1500 
gagcaagaga acatgttggt ttggataacc ggttgcacta tatctgtgag ctctcaaatg 1560 
tcttcttccc aaggcaagag gtggaagggt actgactggg tttgtttaaa gtcaggcagg 1620 
gctggagtga gcagccaggg ccatgttgca caaggcctga gagacgggaa agggcccgat 168 0 
cgctctttcc cgcctctcac tggtgcgatg gaaggtggcc tttctcccaa gctggtggat 174 0 
aatgaaaaat aaagcatccc atctctcggc gttccagcat cctgtcaatt tcccttttgc 1800 
tctagaggat gcatgtttat ttgaggggat gtggcactga gcccacagga gtaaaagccc 1860 
agtttgctag gaggtctgct tactgaaaac aaggagacct ggggtgggtg tggttggggg 1920 
tcttaaaact aataaaagct ggggtcgggg ggcttttgca gctctggtga cattctctcc 198 0 
acggggcaca tttgctcagt cactaatcca gcttgagtgt ccgtgtgttc tgcatgtgca 2040 
ggggtcattc tagtgcccgg tgtgttggca tcatcttttt gctctagccc ttcctctcca 2100 
aaataaaatc aaataaagga aaatctccac ccaaaaaaaa aaaaaaaaaa gg 2152 

<210> 9 

<211> 1878 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_feature 

<223> Incyte ID No: 2214673CB1 

<400> 9 

cgcacagcgc gcaggtcctc accagagctc tggtggccac ctctgtcccg ccatgctgct 6 0 
caccgacagt ggccagggcc cacagcacca agaggcttgg gccacaaagt aaagggtcgc 12 0 
ggagcctcgc cggccgccat gtggagctgc agctggttca acggcacagg gctggtggag 18 0 
gagctgcctg cctgccagga cctgcagctg gggctgtcac tgttgtcgct gctgggcctg 240 
gtggtgggcg tgccagtggg cctgtgctac aacgccctgc tggtgctggc caacctacac 300 



10/13 



WO 00/15793 



PCT/US99/20958 



agcaaggcca gcatgaccat gccggacgtg tactttgtca acatggcagt ggcaggcctg 360 

gtgctcagcg ccctggcccc tgtgcacctg ctcggccccc cgagctcccg gtgggcgctg 420 

tggagtgtgg gcggcgaagt ccacgtggca ctgcagatcc ccttcaatgt gtcctcactg 480 

gtggccatgt actccaccgc cctgctgagc ctcgaccact acatcgagcg tgcactgccg 540 

cggacctaca tggccagcgt gtacaacacg cggcacgtgt gcggcttcgt gtggggtggc 600 

gcgctgctga ccagcttctc ctcgctgctc ttctacatct gcagccatgt gtccacccgc 660 

gcgctagagt gcgccaagat gcagaacgca gaagctgccg acgccacgct ggtgttcatc 720 

ggctacgtgg tgccagcact ggccaccctc tacgcgctgg tgctactctc ccgcgtccgc 780 

agggaggaca cgcccctgga ccgggacacg ggccggctgg agccctcggc acacaggctg 840 

ctggtggcca ccgtgtgcac gcagtttggg ctctggacgc cacactatct gatcctgctg 900 

gggcacacgg gcatcatctc gcgagggaag cccgtggacg cacactacct ggggctactg 960 

cactttgtga aggatttctc caaactcctg gccttctcca gcagctttgt gacaccactt 1020 

ctctaccgct acatgaacca gagcttcccc agcaagctcc aacggctgat gaaaaagctg 1080 

ccctgcgggg accggcactg ctccccggac cacatggggg tgcagcaggt gctggcgtag 1140 

gcggcccagc cctcctgggg agacgtgact ctggtggacg cagagcactt agttaccctg 1200 

gacgctcccc acatccttcc agaaggagac gagctgctgg aagagaagca ggaggggtgt 1260 

ttttcttgaa gtttcctttt tcccacaaat gccactcttg ggccaaggct gtggtccccg 1320 

tggctggcat ctggcttgag tctccccgag gcctgtgcgt ctcccaaaca cgcagctcaa 13 80 

ggtccacatc cgcaaaagcc tcctcgcctt cagcctcctc agcattcagt ttgtcaatga 1440 

agtgatgaaa gcttagagcc agtatttata ctttgtggtt aaaatacttg attccccctt 1500 

gtttgtttta caaaaacaga tgtttcctag aaaaatgaca aatagtaaaa tgaacaaaac 1560 

cctacgaaag aatggcaaca gccagggtgg ccgggccctg ccagtgggcg gcgtgtgcta 1620 

gcaaggcctg ccgggtgtgc cgcagtcacc acagggttct gagaacattt cacagaagtg 1680 

cctgagacgc ggagacatgg ctggtgttaa atggagctat tcaatagcag tgacgcgctc 1740 

tcctcagcca ccaaatgtcc ctgacaccct ccccagcccc cacagataac atcagctgag 1800 

gtttttttca gtatgaacct gtcctaaatc aattcctcaa agtgtgcaca aaactaaaga 1860 
atataaataa acagaagc 1878 

<210> 10 

<211> 1804 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> misc_f eature 

<223> Incyte ID No: 2488822CB1 

<400> 10 

taagtgttaa ctaaaagcat 
catgtataat acatgttcct 
ggattgaaat ttcaaatatc 
tatattttaa gaaattccag 
acactgatgg aatttcatct 
tctgggttgt atctgcagtt 
atatcaggtc tgagaacaag 
gcttaatggg aatatattta 
acaataagca tgcgcagctg 
ccattctgtc cacagaagta 
tctgcattgt ctatcctttt 
tgattctcat ttggattact 
ttttcaaaaa ctactatgca 
aaagtattgg agcccagatt 
ttatcatcat agttttttcc 
cagcaactga aatacggaat 
ttatagtatt tactgatgca 
tgcttcaggt agaaatacca 
ttaacagtgc tttgaaccca 



tttattaaat tgtccttcac 
ttgattgatt attaatttga 
caacaaacjga tgtttagacc 
tactgtgggt atgcaccaca 
ctagagaatc tcttggcaag 
acctgctttg gaaacatttt 
ctgtatgcca tgtcaatcat 
ttcgtgatcg gaggctttga 
tggatggaga gtactcattg 
tcagttttac tgttaacatt 
agatgtgtga gacctggaaa 
ggttttatag tggctttcat 
cccaatggag tatgcttccc 
tattcagtgg caatttttct 
tatggaagca tgttttatag 
caagttaaaa aagagatgat 
ttatgctgga tacccatttt 
ggtaccataa cctcttgggt 
attctctata ctctgaccac 



agaaactcaa tttattaaac 60 
tatttttagc agcctagaag 120 
tcttgtgaat ctctctcaca 180 
tgttcgcagc tgtaaaccaa 240 
cattattcag agagtatttg 300 
tgtcatttgc atgcgacctt 36 0 
ttctctctgc tgtgccgact 420 
cctaaagttt cgtggagaat 480 
tcagcttgta ggatctttgg 54 0 
tctgacattg gaaaaataca 600 
atgcagaaca attacagttc 660 
tccattgagc aataaggaat 720 
tcttcattca gaagatacag 780 
tggtattaat ttggccgcat 84 0 
tgttcatcaa agtgccataa 900 
ccttgccaaa cgttttttct 960 
tgtagtgaaa tttctttcac 1020 
agtgattttt attctgccca 1080 
aagaccattt aaagaaatga 1140 
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ttcatcggtt ttggtataac tacagacaaa 

catatgctcc atcattcatc tgggtggaaa 

taatgaagcc ggaccttttc acatacccct 

gactcaattc ctattcatga ctgactctga 

ggtgcttcat gagggattta ctggtatgaa 

agctaagata aatattttac aaggacatga 

aagggaagta attatatcaa taatgtatat 

agagaaatct acttcagtaa cattcattca 

ctactatgtg catagcattg caatatagtc 

ctgtagatgg tgtttaatga caaaagacta 

agtagagctt tacctgtcat gtgcatcagc 
ggac 



gaaaatctat ggacagcaaa ggtcagaaaa 1200 
tgtggccact gcaggagatg ccacctgagt 1260 
gtgaaatgtc actgatttct caatcaacga 1320 
aattcatttc ttcgcagaga atactgtggg 1380 
atgaatacca caaaattaat ttataataat 1440 
ggaaaaataa aaatgactaa tgctcttaca 1500 
atattagtag acattttgca taagaaatta 1560 
tttttctaac atgcatttat tgagtaccca 1620 
ctggaagtag acagtgcaga acctttcaat 1680 
tacaaagtcc atctgcagtt cctagtttaa 1740 
aagaatcata gcgattttaa atagaggtgt 1800 

1804 



<210> 11 

<211> 1520 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> misc_feature 

<223> Incyte ID No: 2705201CB1 



<400> 11 

tgccgaagag tctggagcgt cggcgctgcg 
agagagaaag atgagagctc accaggtgct 
ggtggcctct gaaaacgcca gcacatcccg 
cgtgtccctg tgcgacctgg acgccatctg 
gggcgccctg atcacactgc tcctgatgct 
ggagaaggag aagaagagcc ctgtgggcct 
cctctttggg ctgacgtttg ccttcatcat 
ccgcttcctc tggggcgtcc tctttgcgct 
gcgcgtgcgg aggctggtgc ggcatggcac 
ggcgctgtgc ctgatgctgg tgcaagtcat 
gctgcgtgac acaaggccag cctgcgccta 
ctacgacatg gtactgcttg tggtcaccct 
gttcaagagg tggaagctga acggggcctt 
catctgggtg gcctggatga ccatgtacct 
tgcctggaac gaccccacct tggccatcac 
cttccacgcc atccctgaga tccactgcac 
caactacttc gacacgtcgc agcccaggat 
gctgccgcgg gcctatatgg agaacaaggc 
ccgaacagca ggatttccca acggcagctt 
aagacccagc gctccgttta gaagcaacgt 
caacggtggg accatcccaa ctgctccgcc 
gactttaagt tccagagaat cagaatttct 
tcttgaggga gaaatcggta acagttgccg 
gaaatcctag ccaaggggat ttcgtgtaaa 
ccgactgccc gcccctcccc tgccacacac 
ccccacctta aaaaaaaaaa 

<210> 12 

<211> 2919 

<212> DNA 

<213> Homo sapiens 



gggccgcggg ggtcgaatgt tcgtggcatc 60 
caccttcctc ctgctcttcg tgatcacctc 120 
aggctgtggg ctggacctcc tccctcagta 180 
gggcattgtg gtggaggcgg tggccggggc 24 0 
catcctcctg gtgcggctgc ccttcatcaa 3 00 
ccactttctg ttcctcctgg ggaccctggg 360 
ccaggaggac gagaccatct gctctgtccg 420 
ctgcttctcc tgcctgctga gccaggcatg 480 
gggccccgcg ggctggcagc tggtgggcct 54 0 
catcgctgtg gagtggctgg tgctcaccgt 600 
cgagcccatg gactttgtga tggccctcat 660 
ggggctggcc ctcttcactc tgtgcggcaa 720 
cctcctcatc acagccttcc tctctgtgct 780 
cttcggcaat gtcaagctgc agcaggggga 840 
gctggcggcc agcggctggg tcttcgtcat 900 
ccttctgcca gccctgcagg agaacacgcc 960 
gcgggagacg gccttcgagg aggacgtgca 1020 
cttctccatg gatgaacaca atgcagctct 1080 
gggaaaaaga cccagtggca gcttggggaa 114 0 
gtatcagcca actgagatgg ccgtcgtgct 1200 
aagtcacaca ggaagacacc tttggtgaaa 1260 
cttaccgatt tgcctccctg gctgtgtctt 1320 
aaccaggccg cctcacagcc aggaaatttg 1380 
tgtgaacact gacgaactga aaagctaaca 1440 
acagacacgt aataccagac caacctcaat 1500 

1520 
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<220> 

<221> misc_f eature 

<223> Incyte ID No: 3036563CB1 

<400> 12 

atcttgatgg agcagaatca gtactgacag tcaagacctc gaccagggag tggaatggga 60 
acctatcact gcatatttag atataagaat tcatacagta ttgcaaccaa agacgtcatt 120 
gttcacccgc tgcctctaaa gctgaacatc atggttgatc ctttggaagc tactgtttca 180 
tgcagtggtt cccatcacat caagtgctgc atagaggagg atggagacta caaagttact 240 
ttccatatgg gttcctcatc ccttcctgct gcaaaagaag ttaacaaaaa acaagtgtgc 300 
tacaaacaca atttcaatgc aagctcagtt tcctggtgtt caaaaactgt tgatgtgtgt 360 
tgtcacttta ccaatgctgc taataattca gtctggagcc catctatgaa gctgaatctg 420 
gttcctgggg aaaacatcac atgccaggat cccgtaatag gtgtcggaga gccggggaaa 480 
gtcatccaga agctatgccg gttctcaaac gttcccagca gccctgagag tcccattggc 540 
gggaccatca cttacaaatg tgtaggctcc cagtgggagg agaagagaaa tgactgcatc 600 
tctgccccaa taaacagtct gctccagatg gctaaggctt tgatcaagag cccctctcag 660 
gatgagatgc tccctacata cctgaaggat ctttctatta gcataggcaa agcggaacat 720 
gaaatcagct cttctcctgg gagtctggga gccattatta acatccttga tctgctctca 780 
acagttccaa cccaagtaaa ttcagaaatg atgacgcacg tgctctctac ggttaatatc 840 
atccttggca agcccgtctt gaacacctgg aaggttttac aacagcaatg gaccaatcag 900 
agttcacagc tactacattc agtggaaaga ttttcccaag cattacagtc aggagatagc 960 
cctccattgt ccttctccca aactaatgtg cagatgagca gcatggtaat caagtccagc 102 0 
cacccagaaa cctatcaaca gaggtttgtt ttcccatact ttgacctctg gggcaatgtg 108 0 
gtcattgaca agagctacct agaaaacttg cagtcggatt cgtctattgt caccatggct 1140 
ttcccaactc tccaagccat ccttgctcag gatatccagg aaaataactt tgcagagagc 1200 
ttagtgatga caaccactgt cagccacaat acgactatgc cattcaggat ttcaatgact 1260 
tttaagaaca atagcccttc aggcggcgaa acgaagtgtg tcttctggaa cttcaggctt 132 0 
gccaacaaca caggggggtg ggacagcagt gggtgctatg ttgaagaagg tgatggggac 1380 
aatgtcacct gtatctgtga ccacctaaca tcattctcca tcctcatgtc ccctgactcc 1440 
ccagatccta gttctctcct gggaatactc ctggatatta tttcttatgt tggggtgggc 1500 
ttttccatct tgagcttggc agcctgtcta gttgtggaag ctgtggtgtg gaaatcggtg 1560 
accaagaatc ggacttctta tatgcgccac acctgcatag tgaatatcgc tgcctccctt 1620 
ctggtcgcca acacctggtt cattgtggtc gctgccatcc aggacaatcg ctacatactc 1680 
tgcaagacag cctgtgtggc tgccaccttc ttcatccact tcttctacct cagcgtcttc 1740 
ttctggatgc tgacactggg cctcatgctg ttctatcgcc tggttttcat tctgcatgaa 1800 
acaagcaggt ccactcagaa agccattgcc ttctgtcttg gctatggctg cccacttgcc 1860 
atctcggtca tcacgctggg agccacccag ccccgggaag tctatacgag gaagaatgtc 1920 
tgttggctca actgggagga caccaaggcc ctgctggctt tcgccatccc agcactgatc 198 0 
attgtggtgg tgaacataac catcactatt gtggtcatca ccaagatcct gaggccttcc 2040 
attggagaca agccatgcaa gcaggagaag agcagcctgt ttcagatcag caagagcatt 2100 
ggggtcctca caccactctt gggcctcact tggggttttg gtctcaccac tgtgttccca 2160 
gggaccaacc ttgtgttcca tatcatattt gccatcctca atgtcttcca gggattattc 2220 
attttactct ttggatgcct ctgggatctg aaggtacagg aagctttgct gaataagttt 2280 
tcattgtcga gatggtcttc acagcactca aagtcaacat ccctgggttc atccacacct 2340 
gtgttttcta tgagttctcc aatatcaagg agatttaaca atttgtttgg taaaacagga 2400 
acgtataatg tttccacccc agaagcaacc agctcatccc tggaaaactc atccagtgct 2460 
tcttcgttgc tcaactaaga acaggataat ccaacctacg tgacctcccg gggacagtgg 2520 
ctgtgctttt aaaaagagat gcttgcaaag caatggggaa cgtgttctcg gggcaggctt 2580 
ccgggagcag atgccaaaaa gactttttca tagagaagag gctttctttt gtaaagacag 2640 
aataaaaata attgttatgt ttctgtttgt tccctccccc tcccccttgt gtgataccac 2700 
atgtgtatag tatttaagtg aaactcaagc cctcaaggcc caacttctct gtctatattg 2 760 
taatatagaa tttcgaagag acattttcac tttttacaca ttgggcacaa agataagctt 2820 
tgattaaagt agtaagtaaa aggctaccta ggaaatactt cagtgaattc taagaaggaa 2880 
ggaaggaagg aaggagggaa agaagggagg aaaccagga 2919 
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